Isaac McFadyen 6a2bc8bfb7 server : added --no-prefill-assistant flag (#13608) 8 mesi fa
..
batched-bench b89d605a91 batched-bench : fix pp batch contents (#13492) 8 mesi fa
cvector-generator 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 mesi fa
export-lora 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 mesi fa
gguf-split 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 mesi fa
imatrix efb8b47eda imatrix : Add --parse-special for enabling parsing of special tokens in imatrix calculation (#13389) 8 mesi fa
llama-bench 6c8b91500e llama-bench : fix -ot with dl backends (#13563) 8 mesi fa
main 27ebfcacba llama : do not crash if there is no CPU backend (#13395) 8 mesi fa
mtmd 71bdbdb587 clip : clip.h become private API (⚠️ breaking change) (#13510) 8 mesi fa
perplexity 51fb96b1ff context : remove logits_all flag (#13284) 8 mesi fa
quantize e5c834f718 quantize : improve tensor-type pattern matching (#13033) 8 mesi fa
rpc 27ebfcacba llama : do not crash if there is no CPU backend (#13395) 8 mesi fa
run 0527771dd8 llama-run: add support for downloading models from ModelScope (#13370) 8 mesi fa
server 6a2bc8bfb7 server : added --no-prefill-assistant flag (#13608) 8 mesi fa
tokenize 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 mesi fa
tts 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 mesi fa
CMakeLists.txt 9b61acf060 mtmd : rename llava directory to mtmd (#13311) 8 mesi fa