cturan/llama.cpp

Autor	SHA1 Mensaje	Fecha
Xuan-Son Nguyen	b4726345ac mtmd : remove libllava, remove clip-quantize-cli (⚠️ breaking change) (#13460)	hace 8 meses
Sigbjørn Skjæret	bf79371120 scripts : support arbitrary input file formats in compare-llama-bench.py (#13455)	hace 8 meses
Gabe Goodhart	d590cd4c24 model : Granite MoE shared (#13269)	hace 8 meses
Georgi Gerganov	1e2809bc4b sync : ggml	hace 8 meses
Diego Devesa	cf0a43bb64 llama-bench : add defrag-thold, check for invalid ranges (#13487)	hace 8 meses
lhez	f0d46ef157 opencl: remove unnecessary assert for `add` (#13257)	hace 8 meses
Xuan-Son Nguyen	de4c07f937 clip : cap max image size 1024 for qwen vl model (#13478)	hace 8 meses
Johannes Gäßler	10d2af0eaa llama/ggml: add LLM training support (#10544)	hace 8 meses
Georgi Gerganov	064cc596ac context : fix state io for memory-less contexts (#13470)	hace 8 meses
Anudit Nagar	91159ee9df server : allow content to be null in oaicompat_completion_params_parse (#13477)	hace 8 meses
Diego Devesa	22cdab343b llama-bench : accept ranges for integer parameters (#13410)	hace 8 meses
Dan Johansson	a71a4075cd ggml-cpu: Integrate fp32=bf16xbf16 SME KleidiAI kernel (#13053)	hace 8 meses
Johannes Gäßler	95e18884fc CUDA: fix misaligned synchronization in FA (#13469)	hace 8 meses
Xuan-Son Nguyen	df8491922f ggml : add mrope kernel for metal (#13457)	hace 8 meses
Atharva Dubey	14492144c2 enable dpcpp nightly builds with libraries (#13406)	hace 8 meses
City	c104023994 mtmd : Use RMS norm for InternVL 3 38B and 78B mmproj (#13459)	hace 8 meses
Anthony Umfer	9a390c4829 tools : fix uninitialized llama_batch in server (#13436)	hace 8 meses
Sigbjørn Skjæret	09232370fc scripts : exit compare-llama-bench.py gracefully when there's nothing to compare (#13451)	hace 8 meses
Johannes Gäßler	7474e00b34 CUDA: fix crash with partial offloading of MoE (#13439)	hace 8 meses
David Huang	7f323a589f Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386)	hace 8 meses
City	3eac209319 mtmd : support InternVL 3 38B and 78B mmproj (#13443)	hace 8 meses
Xuan-Son Nguyen	a634d75d1b mtmd : move helpers to dedicated file (#13442)	hace 8 meses
Thomas Germer	62d4250e52 docs : Fix typo in InternVL3 model name (#13440)	hace 8 meses
Johannes Gäßler	0208355f42 CUDA: fix race conditions FlashAttention kernels (#13438)	hace 8 meses
Sigbjørn Skjæret	d2a4ef05c6 vocab : add ByteDance-Seed/Seed-Coder (#13423)	hace 8 meses
Xuan-Son Nguyen	15e6125a39 mtmd : add hard limit on image resolution for qwen2vl / qwen2.5vl (#13434)	hace 8 meses
Xuan-Son Nguyen	3b24d26c22 server : update docs (#13432)	hace 8 meses
Sigbjørn Skjæret	43dfd741a5 llguidance : set tokenizer slices to default (#13424)	hace 8 meses
Thammachart Chinvarapon	b064a51a4e ci: free_disk_space flag enabled for intel variant (#13426)	hace 8 meses
Xuan-Son Nguyen	053367d149 mtmd : support InternVL 2.5 and 3 (#13422)	hace 8 meses

Posterior Anterior

Historial de Commits Buscar

Historial de Commits