Aleksander Grygier 13f2cfad41 Enable per-conversation loading states to allow having parallel conversations (#16327) 2 months ago
..
batched-bench 07808ebb07 cmake : Do not install tools on iOS targets (#15903) 4 months ago
cvector-generator 07808ebb07 cmake : Do not install tools on iOS targets (#15903) 4 months ago
export-lora 07808ebb07 cmake : Do not install tools on iOS targets (#15903) 4 months ago
gguf-split 1d660d2fae ci : use smaller model (#16168) 3 months ago
imatrix 07808ebb07 cmake : Do not install tools on iOS targets (#15903) 4 months ago
llama-bench 3df2244df4 llama : add --no-host to disable host buffers (#16310) 3 months ago
main 2f61c0f5bf llama-cli: prevent spurious assistant token (#16202) 3 months ago
mtmd 1bb4f43380 mtmd : support home-cooked Mistral Small Omni (#14928) 3 months ago
perplexity 3ffd0fae47 perplexity : show more kl-divergence data (#16321) 3 months ago
quantize 1d660d2fae ci : use smaller model (#16168) 3 months ago
rpc 41386cf365 rpc : report actual free memory (#16616) 3 months ago
run 4201deae9c common: introduce http.h for httplib-based client (#16373) 3 months ago
server 13f2cfad41 Enable per-conversation loading states to allow having parallel conversations (#16327) 2 months ago
tokenize 07808ebb07 cmake : Do not install tools on iOS targets (#15903) 4 months ago
tts 34fcc5a4ac model : Apertus model implementation (#15852) 3 months ago
CMakeLists.txt 9b61acf060 mtmd : rename llava directory to mtmd (#13311) 8 months ago