cturan/llama.cpp

Autore	SHA1 Messaggio	Data
Piotr Wilkin (ilintar)	c753d7bed0 server : proper error handling for missing elements in messages array (OpenAI compatible backend) (#13540)	8 mesi fa
Georgi Gerganov	b2838049cc bench : handle decode errors (#13548)	8 mesi fa
Olivier Chafik	aa48e373f2 `server`: inject date_string in llama 3.x template + fix date for firefunction v2 (#12802)	8 mesi fa
Georgi Gerganov	e3a9421b78 kv-cache : fix out-of-bounds view during reserve graph (#13547)	8 mesi fa
Yibo Cai	5ab5d5fb25 arm64: optimize q6_k_q8_k kernel with i8mm (#13519)	8 mesi fa
Olivier Chafik	3198405e98 `common`: add partial regex support (#12808)	8 mesi fa
Sigbjørn Skjæret	f5170c1d7a editorconfig : fix trailing whitespace from #13542 (#13546)	8 mesi fa
Gilad S.	017f10b5fa fix: crash when calling `llama_state_get_size` on a context without a KV cache (#13542)	8 mesi fa
Johannes Gäßler	4696d56749 CUDA: fix crash on large batch size for quant. MoE (#13537)	8 mesi fa
Diego Devesa	b7d2672082 llama : fix quantize with dl backends (#13539)	8 mesi fa
Johannes Gäßler	6da34fa276 CUDA: faster Deepseek FA, add Turing support (#13435)	8 mesi fa
Gabe Goodhart	5e7d95e22e fix: Move build_inp_pos to the top of the graph section for build_granite (#13538)	8 mesi fa
Georgi Gerganov	053174436f server : passthrough the /models endpoint during loading (#13535)	8 mesi fa
Xuan-Son Nguyen	360a9c98e1 server : fix cache_tokens bug with no cache_prompt (#13533)	8 mesi fa
bandoti	09d13d94fb cmake: simplify vulkan shader test logic (#13263)	8 mesi fa
Jeff Bolz	24e86cae72 vulkan: KHR_coopmat flash attention (#13506)	8 mesi fa
Xuan-Son Nguyen	bb1681fbd5 webui : use fflate for more deterministic gzip compress (#13525)	8 mesi fa
Luca Stefani	d486dd3e8e webui: Allow pasting file from clipboard (#13526)	8 mesi fa
ddpasa	21ca987fba docs: Update link to ggml-org in multimodal.md (#13513)	8 mesi fa
Sigbjørn Skjæret	be1d4a13db scripts : fix compare-llama-bench.py show parameter (#13514)	8 mesi fa
Jeff Bolz	ab3971f2a0 vulkan: workaround FA compile failures on macos (#13517)	8 mesi fa
Ed Addario	e5c834f718 quantize : improve tensor-type pattern matching (#13033)	8 mesi fa
Xuan-Son Nguyen	71bdbdb587 clip : clip.h become private API (⚠️ breaking change) (#13510)	8 mesi fa
Georgi Gerganov	f0995d28ce metal : use FA-vec kernel up to batch size 20 (#13496)	8 mesi fa
Georgi Gerganov	c252e0c409 metal : optimize multi-sequence FA vec kernel (#13493)	8 mesi fa
Dan Johansson	4f711afed5 ggml-cpu: Update KleidiAI to v1.6 and fix include directives (#13509)	8 mesi fa
Georgi Gerganov	b89d605a91 batched-bench : fix pp batch contents (#13492)	8 mesi fa
Xuan-Son Nguyen	b4726345ac mtmd : remove libllava, remove clip-quantize-cli (⚠️ breaking change) (#13460)	8 mesi fa
Sigbjørn Skjæret	bf79371120 scripts : support arbitrary input file formats in compare-llama-bench.py (#13455)	8 mesi fa
Gabe Goodhart	d590cd4c24 model : Granite MoE shared (#13269)	8 mesi fa

Più recente Più vecchio

Cronologia Commit Cerca

Cronologia Commit