Georgi Gerganov 9ebebef62f llama : remove KV cache defragmentation logic (#15473) há 4 meses atrás
..
batched-bench f0d3c7405c batched-bench : use rand tokens (#15398) há 5 meses atrás
cvector-generator 745aa5319b llama : deprecate llama_kv_self_ API (#14030) há 7 meses atrás
export-lora 749e0d27f0 mtmd : fix 32-bit narrowing issue in export-lora and mtmd clip (#14503) há 5 meses atrás
gguf-split e9b6350e61 scripts : make the shell scripts cross-platform (#14341) há 6 meses atrás
imatrix 19f68fa5a4 imatrix : warn when GGUF imatrix is saved without .gguf suffix (#15076) há 5 meses atrás
llama-bench 9ebebef62f llama : remove KV cache defragmentation logic (#15473) há 4 meses atrás
main f75b830647 chat : include kwargs in template example (#15309) há 5 meses atrás
mtmd e288693669 readme : model : mtdm : lfm2 improvements (#15476) há 4 meses atrás
perplexity 3ea913f1ce perplexity: give more information about constraints on failure (#15303) há 5 meses atrás
quantize fd1234cb46 llama : add gpt-oss (#15091) há 5 meses atrás
rpc c508256db2 rpc : Fix build on OpenBSD (#13541) há 7 meses atrás
run a457551332 cmake : do not search for curl libraries by ourselves (#14613) há 6 meses atrás
server 9ebebef62f llama : remove KV cache defragmentation logic (#15473) há 4 meses atrás
tokenize 1d36b3670b llama : move end-user examples to tools directory (#13249) há 8 meses atrás
tts d2fcd91cf9 server : disable context shift by default (#15416) há 5 meses atrás
CMakeLists.txt 9b61acf060 mtmd : rename llava directory to mtmd (#13311) há 8 meses atrás