cturan/llama.cpp @ 13f2cfad4170c096c51a02c24a6a158cb47f1480

Aleksander Grygier 13f2cfad41 Enable per-conversation loading states to allow having parallel conversations (#16327)		2 months ago
..
batched-bench	07808ebb07 cmake : Do not install tools on iOS targets (#15903)	4 months ago
cvector-generator	07808ebb07 cmake : Do not install tools on iOS targets (#15903)	4 months ago
export-lora	07808ebb07 cmake : Do not install tools on iOS targets (#15903)	4 months ago
gguf-split	1d660d2fae ci : use smaller model (#16168)	3 months ago
imatrix	07808ebb07 cmake : Do not install tools on iOS targets (#15903)	4 months ago
llama-bench	3df2244df4 llama : add --no-host to disable host buffers (#16310)	3 months ago
main	2f61c0f5bf llama-cli: prevent spurious assistant token (#16202)	3 months ago
mtmd	1bb4f43380 mtmd : support home-cooked Mistral Small Omni (#14928)	3 months ago
perplexity	3ffd0fae47 perplexity : show more kl-divergence data (#16321)	3 months ago
quantize	1d660d2fae ci : use smaller model (#16168)	3 months ago
rpc	41386cf365 rpc : report actual free memory (#16616)	3 months ago
run	4201deae9c common: introduce http.h for httplib-based client (#16373)	3 months ago
server	13f2cfad41 Enable per-conversation loading states to allow having parallel conversations (#16327)	2 months ago
tokenize	07808ebb07 cmake : Do not install tools on iOS targets (#15903)	4 months ago
tts	34fcc5a4ac model : Apertus model implementation (#15852)	3 months ago
CMakeLists.txt	9b61acf060 mtmd : rename llava directory to mtmd (#13311)	8 months ago