1
0
Sergey Alirzaev d82f6aa34a server : removed obsolete doc (#15670) 4 сар өмнө
..
batched-bench b3964c1e89 metal : optimize FA vec for large sequences and BS <= 8 (#15566) 4 сар өмнө
cvector-generator 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 сар өмнө
export-lora 749e0d27f0 mtmd : fix 32-bit narrowing issue in export-lora and mtmd clip (#14503) 5 сар өмнө
gguf-split e9b6350e61 scripts : make the shell scripts cross-platform (#14341) 6 сар өмнө
imatrix 19f68fa5a4 imatrix : warn when GGUF imatrix is saved without .gguf suffix (#15076) 5 сар өмнө
llama-bench 9ebebef62f llama : remove KV cache defragmentation logic (#15473) 4 сар өмнө
main d35a1e8c41 cli : change log to warning to explain reason for stopping (#15604) 4 сар өмнө
mtmd 8ce3ff1d91 mtmd : fix mtmd ios build (#15579) 4 сар өмнө
perplexity 3ea913f1ce perplexity: give more information about constraints on failure (#15303) 5 сар өмнө
quantize fd1234cb46 llama : add gpt-oss (#15091) 5 сар өмнө
rpc c508256db2 rpc : Fix build on OpenBSD (#13541) 7 сар өмнө
run a457551332 cmake : do not search for curl libraries by ourselves (#14613) 6 сар өмнө
server d82f6aa34a server : removed obsolete doc (#15670) 4 сар өмнө
tokenize 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 сар өмнө
tts d2fcd91cf9 server : disable context shift by default (#15416) 5 сар өмнө
CMakeLists.txt 9b61acf060 mtmd : rename llava directory to mtmd (#13311) 8 сар өмнө