cturan/llama.cpp @ 7ea15bb64c81e3813eb0babf9a57e1bc5697f569

Georgi Gerganov bc07349a7f server : dynamic token limit for prompt cache (#16560)		3 mesi fa
..
batched-bench	07808ebb07 cmake : Do not install tools on iOS targets (#15903)	4 mesi fa
cvector-generator	07808ebb07 cmake : Do not install tools on iOS targets (#15903)	4 mesi fa
export-lora	07808ebb07 cmake : Do not install tools on iOS targets (#15903)	4 mesi fa
gguf-split	1d660d2fae ci : use smaller model (#16168)	3 mesi fa
imatrix	07808ebb07 cmake : Do not install tools on iOS targets (#15903)	4 mesi fa
llama-bench	3df2244df4 llama : add --no-host to disable host buffers (#16310)	3 mesi fa
main	2f61c0f5bf llama-cli: prevent spurious assistant token (#16202)	3 mesi fa
mtmd	c08002a198 chat : Granite Docling stopping (#16438)	3 mesi fa
perplexity	3ffd0fae47 perplexity : show more kl-divergence data (#16321)	3 mesi fa
quantize	1d660d2fae ci : use smaller model (#16168)	3 mesi fa
rpc	c61ae20d05 rpc : update documentation (#16441)	3 mesi fa
run	4201deae9c common: introduce http.h for httplib-based client (#16373)	3 mesi fa
server	bc07349a7f server : dynamic token limit for prompt cache (#16560)	3 mesi fa
tokenize	07808ebb07 cmake : Do not install tools on iOS targets (#15903)	4 mesi fa
tts	34fcc5a4ac model : Apertus model implementation (#15852)	3 mesi fa
CMakeLists.txt	9b61acf060 mtmd : rename llava directory to mtmd (#13311)	8 mesi fa