cturan/llama.cpp @ d8914fc47e8a69b28c670325cb1c8ce33e3a2960

Copilot d8914fc47e common : add --override-tensor-draft, --cpu-moe-draft and --n-cpu-moe-draft parameters (#15191)		5 mesi fa
..
batched-bench	225e7a1438 llama : add high-throughput mode (#14363)	6 mesi fa
cvector-generator	745aa5319b llama : deprecate llama_kv_self_ API (#14030)	7 mesi fa
export-lora	749e0d27f0 mtmd : fix 32-bit narrowing issue in export-lora and mtmd clip (#14503)	5 mesi fa
gguf-split	e9b6350e61 scripts : make the shell scripts cross-platform (#14341)	6 mesi fa
imatrix	19f68fa5a4 imatrix : warn when GGUF imatrix is saved without .gguf suffix (#15076)	5 mesi fa
llama-bench	476aa3fd57 Fixed name `-override-tensors` to `-override-tensor` (#15129)	5 mesi fa
main	c82d48ec23 llama : fix `--reverse-prompt` crashing issue (#14794)	6 mesi fa
mtmd	cf9e5648a7 mtmd : Fix MinicpmV model converter and clip to avoid using hardcode. (#14750)	5 mesi fa
perplexity	1ebbaddff2 perplexity : update comments/error msg to use decode [no ci] (#15227)	5 mesi fa
quantize	fd1234cb46 llama : add gpt-oss (#15091)	5 mesi fa
rpc	c508256db2 rpc : Fix build on OpenBSD (#13541)	7 mesi fa
run	a457551332 cmake : do not search for curl libraries by ourselves (#14613)	6 mesi fa
server	d8914fc47e common : add --override-tensor-draft, --cpu-moe-draft and --n-cpu-moe-draft parameters (#15191)	5 mesi fa
tokenize	1d36b3670b llama : move end-user examples to tools directory (#13249)	8 mesi fa
tts	53f925074d sync : vendor (#13901)	7 mesi fa
CMakeLists.txt	9b61acf060 mtmd : rename llava directory to mtmd (#13311)	8 mesi fa