| .. |
|
batched-bench
|
6b64f74b55
batched-bench : fix unified KV cache handling + pp timing (#15562)
|
4 months ago |
|
cvector-generator
|
745aa5319b
llama : deprecate llama_kv_self_ API (#14030)
|
7 months ago |
|
export-lora
|
749e0d27f0
mtmd : fix 32-bit narrowing issue in export-lora and mtmd clip (#14503)
|
5 months ago |
|
gguf-split
|
e9b6350e61
scripts : make the shell scripts cross-platform (#14341)
|
6 months ago |
|
imatrix
|
19f68fa5a4
imatrix : warn when GGUF imatrix is saved without .gguf suffix (#15076)
|
5 months ago |
|
llama-bench
|
9ebebef62f
llama : remove KV cache defragmentation logic (#15473)
|
4 months ago |
|
main
|
f75b830647
chat : include kwargs in template example (#15309)
|
5 months ago |
|
mtmd
|
c4e9239064
model : support MiniCPM-V 4.5 (#15575)
|
4 months ago |
|
perplexity
|
3ea913f1ce
perplexity: give more information about constraints on failure (#15303)
|
5 months ago |
|
quantize
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 months ago |
|
rpc
|
c508256db2
rpc : Fix build on OpenBSD (#13541)
|
7 months ago |
|
run
|
a457551332
cmake : do not search for curl libraries by ourselves (#14613)
|
6 months ago |
|
server
|
9ebebef62f
llama : remove KV cache defragmentation logic (#15473)
|
4 months ago |
|
tokenize
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
8 months ago |
|
tts
|
d2fcd91cf9
server : disable context shift by default (#15416)
|
5 months ago |
|
CMakeLists.txt
|
9b61acf060
mtmd : rename llava directory to mtmd (#13311)
|
8 months ago |