cturan/llama.cpp @ 261e6a20ffdb79c4875e674b4f6b514bc73cff8f

Radoslav Gerganov 918b26f197 rpc : fix regression when --device is used (#15981)		před 4 měsíci
..
batched-bench	a885dcff11 batched-bench : fix llama_synchronize usage during prompt processing (#15835)	před 4 měsíci
cvector-generator	745aa5319b llama : deprecate llama_kv_self_ API (#14030)	před 7 měsíci
export-lora	749e0d27f0 mtmd : fix 32-bit narrowing issue in export-lora and mtmd clip (#14503)	před 6 měsíci
gguf-split	e9b6350e61 scripts : make the shell scripts cross-platform (#14341)	před 6 měsíci
imatrix	19f68fa5a4 imatrix : warn when GGUF imatrix is saved without .gguf suffix (#15076)	před 5 měsíci
llama-bench	360d6533db ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797)	před 4 měsíci
main	d1c6f11f47 doc : update documentation for --tensor-split (#15980)	před 4 měsíci
mtmd	50f4281a6f llama : allow using iGPUs with --device (#15951)	před 4 měsíci
perplexity	3ea913f1ce perplexity: give more information about constraints on failure (#15303)	před 5 měsíci
quantize	fd1234cb46 llama : add gpt-oss (#15091)	před 5 měsíci
rpc	918b26f197 rpc : fix regression when --device is used (#15981)	před 4 měsíci
run	a457551332 cmake : do not search for curl libraries by ourselves (#14613)	před 6 měsíci
server	f088b6a84f server : adjust prompt similarity thold + add logs (#15913)	před 4 měsíci
tokenize	1d36b3670b llama : move end-user examples to tools directory (#13249)	před 8 měsíci
tts	e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665)	před 4 měsíci
CMakeLists.txt	9b61acf060 mtmd : rename llava directory to mtmd (#13311)	před 8 měsíci