cturan/llama.cpp @ 8a7b2fa528f130631a5f43648481596ab320ed5a

Georgi Gerganov bcc0eb4591 llama : per-layer KV cache + quantum K cache (#4309)		před 2 roky
..
CMakeLists.txt	15f5d96037 build : fix build info generation and cleanup Makefile (#3920)	před 2 roky
base64.hpp	381efbf480 llava : expose as a shared library for downstream projects (#3613)	před 2 roky
build-info.cpp.in	b12fa0d1c1 build : link against build info instead of compiling against it (#3879)	před 2 roky
common.cpp	bcc0eb4591 llama : per-layer KV cache + quantum K cache (#4309)	před 2 roky
common.h	bcc0eb4591 llama : per-layer KV cache + quantum K cache (#4309)	před 2 roky
console.cpp	3aefaab9e5 check C++ code with -Wmissing-declarations (#3184)	před 2 roky
console.h	6381d4e110 gguf : new file format with flexible meta data (beta) (#2398)	před 2 roky
grammar-parser.cpp	4fa44e84ad grammar-parser : fix typo (#4318)	před 2 roky
grammar-parser.h	6381d4e110 gguf : new file format with flexible meta data (beta) (#2398)	před 2 roky
log.h	a2758d08e4 log : make generating separate log files optional (#3787)	před 2 roky
sampling.cpp	caa9249217 common : fix compile warning	před 2 roky
sampling.h	52c8bc3cf3 sampling : custom samplers order (#4285)	před 2 roky
stb_image.h	370359e5ba examples: support LLaVA v1.5 (multimodal model) (#3436)	před 2 roky
train.cpp	ba4cf5c0bf train : move number of gpu layers argument parsing to common/train.cpp (#4074)	před 2 roky
train.h	4760e7cc0b sync : ggml (backend v2) (#3912)	před 2 roky