cturan/llama.cpp

Autor	SHA1 Mensaxe	Data
Michael Podvitskiy	1031771faa CMake fix: host for msvc compiler can only be x86 or x64 (#8624)	hai 1 ano
slaren	4db04784f9 cuda : fix defrag with quantized KV (#9319)	hai 1 ano
slaren	bdf314f38a llama-bench : fix NUL terminators in CPU name (#9313)	hai 1 ano
Srihari-mcw	581c305186 ggml : AVX2 support for Q4_0_8_8 (#8713)	hai 1 ano
Ouadie EL FAROUKI	5910ea9427 [SYCL] Fix DMMV dequantization (#9279)	hai 1 ano
杨朱 · Kiki	c8671ae282 Fix broken links in docker.md (#9306)	hai 1 ano
Radoslav Gerganov	82e3b03c11 rpc : make RPC servers come first in the device list (#9296)	hai 1 ano
Pascal Patry	9379d3cc17 readme : rename result_format to response_format (#9300)	hai 1 ano
Georgi Gerganov	7605ae7daf flake.lock: Update (#9261)	hai 1 ano
Aarni Koskela	8962422b1c llama-bench : add JSONL (NDJSON) output mode (#9288)	hai 1 ano
Georgi Gerganov	b69a480af4 readme : refactor API section + remove old hot topics	hai 1 ano
Xuan Son Nguyen	48baa61ecc server : test script : add timeout for all requests (#9282)	hai 1 ano
Zhenwei Jin	f1485161e5 src: make tail invalid when kv cell is intersection for mamba (#9249)	hai 1 ano
slaren	048de848ee docker : fix missing binaries in full-cuda image (#9278)	hai 1 ano
yuri@FreeBSD	f771d064a9 ggml : add pthread includes on FreeBSD (#9258)	hai 1 ano
Xuan Son Nguyen	6e7d133a5f server : refactor multitask handling (#9274)	hai 1 ano
Guoliang Hua	b60074f1c2 llama-cli : remove duplicated log message (#9275)	hai 1 ano
Tushar	9c1ba55733 build(nix): Package gguf-py (#5664)	hai 1 ano
Georgi Gerganov	c6d4cb4655 llama : minor style	hai 1 ano
Molly Sophia	8f1d81a0b6 llama : support RWKV v6 models (#8980)	hai 1 ano
Echo Nolan	a47667cff4 nix: fix CUDA build - replace deprecated autoAddOpenGLRunpathHook	hai 1 ano
Srihari-mcw	ea5d7478b1 sgemm : improved Q4_0 and Q8_0 performance via 4xN and Mx4 gemm (#8908)	hai 1 ano
Daniel Bevenius	49271efbaf llama : fix typo in xcda_array_view comment [no ci] (#9132)	hai 1 ano
Sutou Kouhei	0ab30f8d82 llama : fix llama_split_mode enum values in main_gpu document (#9057)	hai 1 ano
蕭澧邦	cddae4884c Correct typo run_llama2.sh > run-llama2.sh (#9149)	hai 1 ano
tc-mb	7ea8d80d53 llava : the function "clip" should be int (#9237)	hai 1 ano
Faisal Zaghloul	42c76d1358 Threadpool: take 2 (#8672)	hai 1 ano
Jan Boon	9f7d4bcf5c server : fix crash when error handler dumps invalid utf-8 json (#9195)	hai 1 ano
Georgi Gerganov	1d1ccce676 flake.lock: Update (#9162)	hai 1 ano
slaren	9fe94ccac9 docker : build images only once (#9225)	hai 1 ano

Posterior Anterior

Commit History Buscar

Commit History