Georgi Gerganov f088b6a84f server : adjust prompt similarity thold + add logs (#15913) преди 4 месеца
..
batched-bench a885dcff11 batched-bench : fix llama_synchronize usage during prompt processing (#15835) преди 4 месеца
cvector-generator 745aa5319b llama : deprecate llama_kv_self_ API (#14030) преди 8 месеца
export-lora 749e0d27f0 mtmd : fix 32-bit narrowing issue in export-lora and mtmd clip (#14503) преди 6 месеца
gguf-split e9b6350e61 scripts : make the shell scripts cross-platform (#14341) преди 7 месеца
imatrix 19f68fa5a4 imatrix : warn when GGUF imatrix is saved without .gguf suffix (#15076) преди 6 месеца
llama-bench 360d6533db ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797) преди 4 месеца
main d35a1e8c41 cli : change log to warning to explain reason for stopping (#15604) преди 5 месеца
mtmd 70cd37dbbe requirements : update transformers/torch for Embedding Gemma (#15828) преди 4 месеца
perplexity 3ea913f1ce perplexity: give more information about constraints on failure (#15303) преди 5 месеца
quantize fd1234cb46 llama : add gpt-oss (#15091) преди 6 месеца
rpc c508256db2 rpc : Fix build on OpenBSD (#13541) преди 8 месеца
run a457551332 cmake : do not search for curl libraries by ourselves (#14613) преди 6 месеца
server f088b6a84f server : adjust prompt similarity thold + add logs (#15913) преди 4 месеца
tokenize 1d36b3670b llama : move end-user examples to tools directory (#13249) преди 9 месеца
tts e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665) преди 5 месеца
CMakeLists.txt 9b61acf060 mtmd : rename llava directory to mtmd (#13311) преди 9 месеца