| .. |
|
batched-bench
|
a885dcff11
batched-bench : fix llama_synchronize usage during prompt processing (#15835)
|
vor 4 Monaten |
|
cvector-generator
|
745aa5319b
llama : deprecate llama_kv_self_ API (#14030)
|
vor 7 Monaten |
|
export-lora
|
749e0d27f0
mtmd : fix 32-bit narrowing issue in export-lora and mtmd clip (#14503)
|
vor 5 Monaten |
|
gguf-split
|
e9b6350e61
scripts : make the shell scripts cross-platform (#14341)
|
vor 6 Monaten |
|
imatrix
|
19f68fa5a4
imatrix : warn when GGUF imatrix is saved without .gguf suffix (#15076)
|
vor 5 Monaten |
|
llama-bench
|
360d6533db
ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797)
|
vor 4 Monaten |
|
main
|
d1c6f11f47
doc : update documentation for --tensor-split (#15980)
|
vor 4 Monaten |
|
mtmd
|
50f4281a6f
llama : allow using iGPUs with --device (#15951)
|
vor 4 Monaten |
|
perplexity
|
3ea913f1ce
perplexity: give more information about constraints on failure (#15303)
|
vor 5 Monaten |
|
quantize
|
fd1234cb46
llama : add gpt-oss (#15091)
|
vor 5 Monaten |
|
rpc
|
50f4281a6f
llama : allow using iGPUs with --device (#15951)
|
vor 4 Monaten |
|
run
|
a457551332
cmake : do not search for curl libraries by ourselves (#14613)
|
vor 6 Monaten |
|
server
|
f088b6a84f
server : adjust prompt similarity thold + add logs (#15913)
|
vor 4 Monaten |
|
tokenize
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
vor 8 Monaten |
|
tts
|
e92d53b29e
sampling : optimize samplers by reusing bucket sort (#15665)
|
vor 4 Monaten |
|
CMakeLists.txt
|
9b61acf060
mtmd : rename llava directory to mtmd (#13311)
|
vor 8 Monaten |