cturan/llama.cpp @ 95e18884fc7ea4031f70f1a518d5d1df616e5717

City c104023994 mtmd : Use RMS norm for InternVL 3 38B and 78B mmproj (#13459)		hace 8 meses
..
batched-bench	1d36b3670b llama : move end-user examples to tools directory (#13249)	hace 8 meses
cvector-generator	1d36b3670b llama : move end-user examples to tools directory (#13249)	hace 8 meses
export-lora	1d36b3670b llama : move end-user examples to tools directory (#13249)	hace 8 meses
gguf-split	1d36b3670b llama : move end-user examples to tools directory (#13249)	hace 8 meses
imatrix	efb8b47eda imatrix : Add --parse-special for enabling parsing of special tokens in imatrix calculation (#13389)	hace 8 meses
llama-bench	7f323a589f Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386)	hace 8 meses
main	27ebfcacba llama : do not crash if there is no CPU backend (#13395)	hace 8 meses
mtmd	c104023994 mtmd : Use RMS norm for InternVL 3 38B and 78B mmproj (#13459)	hace 8 meses
perplexity	51fb96b1ff context : remove logits_all flag (#13284)	hace 8 meses
quantize	1d36b3670b llama : move end-user examples to tools directory (#13249)	hace 8 meses
rpc	27ebfcacba llama : do not crash if there is no CPU backend (#13395)	hace 8 meses
run	0527771dd8 llama-run: add support for downloading models from ModelScope (#13370)	hace 8 meses
server	9a390c4829 tools : fix uninitialized llama_batch in server (#13436)	hace 8 meses
tokenize	1d36b3670b llama : move end-user examples to tools directory (#13249)	hace 8 meses
tts	1d36b3670b llama : move end-user examples to tools directory (#13249)	hace 8 meses
CMakeLists.txt	9b61acf060 mtmd : rename llava directory to mtmd (#13311)	hace 8 meses