Georgi Gerganov 3637576288 server : disable speculative decoding for SWA models (#13970) il y a 7 mois
..
batched-bench b89d605a91 batched-bench : fix pp batch contents (#13492) il y a 8 mois
cvector-generator 1d36b3670b llama : move end-user examples to tools directory (#13249) il y a 8 mois
export-lora 1d36b3670b llama : move end-user examples to tools directory (#13249) il y a 8 mois
gguf-split 1d36b3670b llama : move end-user examples to tools directory (#13249) il y a 8 mois
imatrix efb8b47eda imatrix : Add --parse-special for enabling parsing of special tokens in imatrix calculation (#13389) il y a 8 mois
llama-bench 053b1539c0 threading: support for GGML_SCHED_PRIO_LOW, update thread info on Windows to avoid throttling (#12995) il y a 7 mois
main 27ebfcacba llama : do not crash if there is no CPU backend (#13395) il y a 8 mois
mtmd bfd322796c mtmd : fix memory leak in mtmd_helper_eval_chunk_single (#13961) il y a 7 mois
perplexity 51fb96b1ff context : remove logits_all flag (#13284) il y a 8 mois
quantize e5c834f718 quantize : improve tensor-type pattern matching (#13033) il y a 8 mois
rpc c508256db2 rpc : Fix build on OpenBSD (#13541) il y a 7 mois
run 53f925074d sync : vendor (#13901) il y a 7 mois
server 3637576288 server : disable speculative decoding for SWA models (#13970) il y a 7 mois
tokenize 1d36b3670b llama : move end-user examples to tools directory (#13249) il y a 8 mois
tts 53f925074d sync : vendor (#13901) il y a 7 mois
CMakeLists.txt 9b61acf060 mtmd : rename llava directory to mtmd (#13311) il y a 8 mois