cturan/llama.cpp

Autor	SHA1 Mensagem	Data
slaren	6c89eb0b47 ci : disable rocm image creation (#9340)	1 ano atrás
Xuan Son Nguyen	9b2c24c099 server : simplify state machine for slot (#9283)	1 ano atrás
Aarni Koskela	134bc38ecf llama-bench : log benchmark progress (#9287)	1 ano atrás
Aarni Koskela	815b1fb20a batched-bench : add `--output-format jsonl` option (#9293)	1 ano atrás
Changyeon Kim	409dc4f8bb ggml : fix build break for the vulkan-debug (#9265)	1 ano atrás
Xuan Son Nguyen	4a1411b4f1 server : fix missing lock (#9334)	1 ano atrás
Markus Tavenrath	8ebe8ddebd Improve Vulkan shader build system (#9239)	1 ano atrás
compilade	9bc6db28d0 ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151)	1 ano atrás
awatuna	32b2ec88bc Update build.yml (#9184)	1 ano atrás
Michael Podvitskiy	1031771faa CMake fix: host for msvc compiler can only be x86 or x64 (#8624)	1 ano atrás
slaren	4db04784f9 cuda : fix defrag with quantized KV (#9319)	1 ano atrás
slaren	bdf314f38a llama-bench : fix NUL terminators in CPU name (#9313)	1 ano atrás
Srihari-mcw	581c305186 ggml : AVX2 support for Q4_0_8_8 (#8713)	1 ano atrás
Ouadie EL FAROUKI	5910ea9427 [SYCL] Fix DMMV dequantization (#9279)	1 ano atrás
杨朱 · Kiki	c8671ae282 Fix broken links in docker.md (#9306)	1 ano atrás
Radoslav Gerganov	82e3b03c11 rpc : make RPC servers come first in the device list (#9296)	1 ano atrás
Pascal Patry	9379d3cc17 readme : rename result_format to response_format (#9300)	1 ano atrás
Georgi Gerganov	7605ae7daf flake.lock: Update (#9261)	1 ano atrás
Aarni Koskela	8962422b1c llama-bench : add JSONL (NDJSON) output mode (#9288)	1 ano atrás
Georgi Gerganov	b69a480af4 readme : refactor API section + remove old hot topics	1 ano atrás
Xuan Son Nguyen	48baa61ecc server : test script : add timeout for all requests (#9282)	1 ano atrás
Zhenwei Jin	f1485161e5 src: make tail invalid when kv cell is intersection for mamba (#9249)	1 ano atrás
slaren	048de848ee docker : fix missing binaries in full-cuda image (#9278)	1 ano atrás
yuri@FreeBSD	f771d064a9 ggml : add pthread includes on FreeBSD (#9258)	1 ano atrás
Xuan Son Nguyen	6e7d133a5f server : refactor multitask handling (#9274)	1 ano atrás
Guoliang Hua	b60074f1c2 llama-cli : remove duplicated log message (#9275)	1 ano atrás
Tushar	9c1ba55733 build(nix): Package gguf-py (#5664)	1 ano atrás
Georgi Gerganov	c6d4cb4655 llama : minor style	1 ano atrás
Molly Sophia	8f1d81a0b6 llama : support RWKV v6 models (#8980)	1 ano atrás
Echo Nolan	a47667cff4 nix: fix CUDA build - replace deprecated autoAddOpenGLRunpathHook	1 ano atrás

Mais recente Mais Antigo

Histórico de commits Buscar

Histórico de commits