cturan/llama.cpp @ 39842a7f73012eb42816ca4f26411782bd3da7c5

Georgi Gerganov 6b64f74b55 batched-bench : fix unified KV cache handling + pp timing (#15562)		4 сар өмнө
..
batched-bench	6b64f74b55 batched-bench : fix unified KV cache handling + pp timing (#15562)	4 сар өмнө
cvector-generator	745aa5319b llama : deprecate llama_kv_self_ API (#14030)	7 сар өмнө
export-lora	749e0d27f0 mtmd : fix 32-bit narrowing issue in export-lora and mtmd clip (#14503)	5 сар өмнө
gguf-split	e9b6350e61 scripts : make the shell scripts cross-platform (#14341)	6 сар өмнө
imatrix	19f68fa5a4 imatrix : warn when GGUF imatrix is saved without .gguf suffix (#15076)	5 сар өмнө
llama-bench	9ebebef62f llama : remove KV cache defragmentation logic (#15473)	4 сар өмнө
main	f75b830647 chat : include kwargs in template example (#15309)	5 сар өмнө
mtmd	e288693669 readme : model : mtdm : lfm2 improvements (#15476)	4 сар өмнө
perplexity	3ea913f1ce perplexity: give more information about constraints on failure (#15303)	5 сар өмнө
quantize	fd1234cb46 llama : add gpt-oss (#15091)	5 сар өмнө
rpc	c508256db2 rpc : Fix build on OpenBSD (#13541)	7 сар өмнө
run	a457551332 cmake : do not search for curl libraries by ourselves (#14613)	6 сар өмнө
server	9ebebef62f llama : remove KV cache defragmentation logic (#15473)	4 сар өмнө
tokenize	1d36b3670b llama : move end-user examples to tools directory (#13249)	8 сар өмнө
tts	d2fcd91cf9 server : disable context shift by default (#15416)	5 сар өмнө
CMakeLists.txt	9b61acf060 mtmd : rename llava directory to mtmd (#13311)	8 сар өмнө