Xuan-Son Nguyen f08c4c0d8d mtmd : clean up clip_n_output_tokens (#15391) 5 月之前
..
batched-bench 225e7a1438 llama : add high-throughput mode (#14363) 6 月之前
cvector-generator 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 月之前
export-lora 749e0d27f0 mtmd : fix 32-bit narrowing issue in export-lora and mtmd clip (#14503) 5 月之前
gguf-split e9b6350e61 scripts : make the shell scripts cross-platform (#14341) 6 月之前
imatrix 19f68fa5a4 imatrix : warn when GGUF imatrix is saved without .gguf suffix (#15076) 5 月之前
llama-bench 476aa3fd57 Fixed name `-override-tensors` to `-override-tensor` (#15129) 5 月之前
main f75b830647 chat : include kwargs in template example (#15309) 5 月之前
mtmd f08c4c0d8d mtmd : clean up clip_n_output_tokens (#15391) 5 月之前
perplexity 3ea913f1ce perplexity: give more information about constraints on failure (#15303) 5 月之前
quantize fd1234cb46 llama : add gpt-oss (#15091) 5 月之前
rpc c508256db2 rpc : Fix build on OpenBSD (#13541) 7 月之前
run a457551332 cmake : do not search for curl libraries by ourselves (#14613) 6 月之前
server d1d8241600 server : fix incoming tasks not process in order (#15395) 5 月之前
tokenize 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 月之前
tts 53f925074d sync : vendor (#13901) 7 月之前
CMakeLists.txt 9b61acf060 mtmd : rename llava directory to mtmd (#13311) 8 月之前