cturan/llama.cpp @ 22da05536ff4ad963080773bef1fb839fdab95d3

Georgi Gerganov 6b0a7420d0 llama : KV cache view API + better KV cache management (#4170)		hace 2 años
..
CMakeLists.txt	381efbf480 llava : expose as a shared library for downstream projects (#3613)	hace 2 años
base64.hpp	381efbf480 llava : expose as a shared library for downstream projects (#3613)	hace 2 años
build-info.cpp.in	b12fa0d1c1 build : link against build info instead of compiling against it (#3879)	hace 2 años
common.cpp	6b0a7420d0 llama : KV cache view API + better KV cache management (#4170)	hace 2 años
common.h	6b0a7420d0 llama : KV cache view API + better KV cache management (#4170)	hace 2 años
console.cpp	3aefaab9e5 check C++ code with -Wmissing-declarations (#3184)	hace 2 años
console.h	6381d4e110 gguf : new file format with flexible meta data (beta) (#2398)	hace 2 años
grammar-parser.cpp	f439e506e8 ggml : fix rope + llama minor optimizations (#3560)	hace 2 años
grammar-parser.h	6381d4e110 gguf : new file format with flexible meta data (beta) (#2398)	hace 2 años
log.h	a2758d08e4 log : make generating separate log files optional (#3787)	hace 2 años
sampling.cpp	e75dfdd31b sampling : null grammar field after reset (#3885)	hace 2 años
sampling.h	238657db23 samplers : Min-P sampler implementation [alternative to Top P/Top K] (#3841)	hace 2 años
stb_image.h	370359e5ba examples: support LLaVA v1.5 (multimodal model) (#3436)	hace 2 años
train.cpp	ba4cf5c0bf train : move number of gpu layers argument parsing to common/train.cpp (#4074)	hace 2 años
train.h	4760e7cc0b sync : ggml (backend v2) (#3912)	hace 2 años