Ed Addario e9192bec56 quantize : fix using combined imatrix GGUFs (multiple datasets) (#14973) hai 5 meses
..
batched-bench 225e7a1438 llama : add high-throughput mode (#14363) hai 6 meses
cvector-generator 745aa5319b llama : deprecate llama_kv_self_ API (#14030) hai 7 meses
export-lora 749e0d27f0 mtmd : fix 32-bit narrowing issue in export-lora and mtmd clip (#14503) hai 5 meses
gguf-split e9b6350e61 scripts : make the shell scripts cross-platform (#14341) hai 6 meses
imatrix d1aa0cc5d1 imatrix: add option to display importance score statistics for a given imatrix file (#12718) hai 6 meses
llama-bench c556418b60 llama-bench : use local GPUs along with RPC servers (#14917) hai 5 meses
main c82d48ec23 llama : fix `--reverse-prompt` crashing issue (#14794) hai 6 meses
mtmd 00fa15fedc mtmd : add support for Voxtral (#14862) hai 5 meses
perplexity 745aa5319b llama : deprecate llama_kv_self_ API (#14030) hai 7 meses
quantize e9192bec56 quantize : fix using combined imatrix GGUFs (multiple datasets) (#14973) hai 5 meses
rpc c508256db2 rpc : Fix build on OpenBSD (#13541) hai 7 meses
run a457551332 cmake : do not search for curl libraries by ourselves (#14613) hai 6 meses
server 41e78c567e server : add support for `embd_normalize` parameter (#14964) hai 5 meses
tokenize 1d36b3670b llama : move end-user examples to tools directory (#13249) hai 8 meses
tts 53f925074d sync : vendor (#13901) hai 7 meses
CMakeLists.txt 9b61acf060 mtmd : rename llava directory to mtmd (#13311) hai 8 meses