aa956 d67341dc18 server : add server parameters for draft model cache type (#13782) 7 months ago
..
batched-bench 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 months ago
cvector-generator 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 months ago
export-lora 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 months ago
gguf-split 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 months ago
imatrix 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 months ago
llama-bench fffcce535e llama-bench : add --no-warmup flag (#14224) (#14270) 7 months ago
main 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 months ago
mtmd 413977de32 mtmd : refactor llava-uhd preprocessing logic (#14247) 7 months ago
perplexity 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 months ago
quantize e5c834f718 quantize : improve tensor-type pattern matching (#13033) 8 months ago
rpc c508256db2 rpc : Fix build on OpenBSD (#13541) 8 months ago
run 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 months ago
server d67341dc18 server : add server parameters for draft model cache type (#13782) 7 months ago
tokenize 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 months ago
tts 53f925074d sync : vendor (#13901) 7 months ago
CMakeLists.txt 9b61acf060 mtmd : rename llava directory to mtmd (#13311) 8 months ago