compilade 90083283ec imatrix : use GGUF to store importance matrices (#9400) 6 bulan lalu
..
batched-bench 225e7a1438 llama : add high-throughput mode (#14363) 6 bulan lalu
cvector-generator 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 bulan lalu
export-lora 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 bulan lalu
gguf-split e9b6350e61 scripts : make the shell scripts cross-platform (#14341) 6 bulan lalu
imatrix 90083283ec imatrix : use GGUF to store importance matrices (#9400) 6 bulan lalu
llama-bench fffcce535e llama-bench : add --no-warmup flag (#14224) (#14270) 7 bulan lalu
main abf241045d main : honor --verbose-prompt on interactive prompts (#14350) 6 bulan lalu
mtmd 28657a8229 ggml : implement GEGLU_ERF and GEGLU_QUICK ops (#14445) 6 bulan lalu
perplexity 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 bulan lalu
quantize 90083283ec imatrix : use GGUF to store importance matrices (#9400) 6 bulan lalu
rpc c508256db2 rpc : Fix build on OpenBSD (#13541) 7 bulan lalu
run a457551332 cmake : do not search for curl libraries by ourselves (#14613) 6 bulan lalu
server 6ffd4e9c44 server : pre-calculate EOG logit biases (#14721) 6 bulan lalu
tokenize 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 bulan lalu
tts 53f925074d sync : vendor (#13901) 7 bulan lalu
CMakeLists.txt 9b61acf060 mtmd : rename llava directory to mtmd (#13311) 8 bulan lalu