cturan/llama.cpp

Autor	SHA1 Mensaxe	Data
Sigbjørn Skjæret	65c797c4fa chat : fix yandex chat template (#15116)	hai 6 meses
stevenkuang	25726898e8 chat : fix hunyuan auto-detection (#15114)	hai 6 meses
Chenguang Li	2241453252 CANN: add support for ACL Graph (#15065)	hai 6 meses
Reese Levine	9515c6131a ggml: WebGPU disable SET_ROWS for now (#15078)	hai 6 meses
Georgi Gerganov	fd1234cb46 llama : add gpt-oss (#15091)	hai 6 meses
Sigbjørn Skjæret	f324a3b715 chat : only remove double bos/eos if added (#15086)	hai 6 meses
Georgi Gerganov	be42642581 readme : update hot topics (#15097)	hai 6 meses
Romain Biessy	3306ceabf0 sycl: fix mul_mat selection (#15092)	hai 6 meses
Juk Armstrong	c81de6e107 Fix `glm4moe` bug (#15088)	hai 6 meses
Alex Wu	22f060c9c4 webui: fix markdown table (#15081)	hai 6 meses
compilade	ee3a9fcf88 context : fix index overflow on huge outputs (#15080)	hai 6 meses
Diego Devesa	ec428b02c3 llama : add --n-cpu-moe option (#15077)	hai 6 meses
compilade	19f68fa5a4 imatrix : warn when GGUF imatrix is saved without .gguf suffix (#15076)	hai 6 meses
Christian Kastner	41613437ff cmake: Add GGML_BACKEND_DIR option (#15074)	hai 6 meses
Sigbjørn Skjæret	e5bebe5251 gguf-py : add --chat-template-file to gguf_new_metadata (#15075)	hai 6 meses
Sam	ef0144c087 model: support GLM 4.5 family of models (#14939)	hai 6 meses
Sigbjørn Skjæret	2721257e3e quantize : fix confusing error message if ftype is invalid (#15071)	hai 6 meses
Reese Levine	587d0118f5 ggml: WebGPU backend host improvements and style fixing (#14978)	hai 6 meses
Jeff Bolz	5aa1105da2 vulkan: fix build when using glslang that does not support coopmat2 (#15062)	hai 6 meses
compilade	d31192b4ee imatrix : use GGUF by default (#14842)	hai 6 meses
compilade	0a2f5496be imatrix : fix 3d activation handling for hybrid and recurrent models (#14994)	hai 6 meses
compilade	11a3811164 memory : handle kv_unified for hybrid models (#15050)	hai 6 meses
Csaba Kecskemeti	97366dc6ab vocab : JetBrains Mellum pre-tokenizer (#15045)	hai 6 meses
Gabriel Larson	83bc2f288c model : add text-only support for Kimi-VL (and find special tokens in text_config) (#15051)	hai 6 meses
Jeff Bolz	6c7a441161 vulkan: Use coopmat2 for conv2d (#14982)	hai 6 meses
lhez	5c0eb5ef54 opencl: fix adreno compiler detection logic (#15029)	hai 6 meses
Johannes Gäßler	03d4698218 CUDA: use mma FA kernel for gqa > 4 on RTX 4000 (#15035)	hai 6 meses
leejet	3303c19b16 cuda: make im2col a little faster (#15025)	hai 6 meses
Daniel Bevenius	4fdea540bd kv-cache : skip alignment of n_stream in kv-cache log msg [no ci] (#15040)	hai 6 meses
Georgi Gerganov	a4569c41fd llama : enable LLAMA_SET_ROWS=1 by default (#14959)	hai 6 meses

Posterior Anterior

Commit History Buscar

Commit History