cturan/llama.cpp

Author	SHA1 Message	Date
Georgi Gerganov	67ae5312e2 metal : fix thread-safety (#14300)	7 months ago
Georgi Gerganov	692e3cdd0a memory : rename interface to llama_memory_context_i (#14296)	7 months ago
Daniel Han	b23fa0b3f4 convert : fix Llama 4 conversion (#14311)	7 months ago
Georgi Gerganov	06cbedfca1 sync : ggml	7 months ago
Acly	b7147673f2 Add `ggml_roll` (ggml/1274)	7 months ago
David Chiu	d860dd99a4 docs : fix the link to llama.h (#14293)	7 months ago
Aman Gupta	c959f462a0 CUDA: add conv_2d_transpose (#14287)	7 months ago
Sigbjørn Skjæret	22015b2092 lint : remove trailing whitepace (#14304)	7 months ago
Ruikai Peng	dd6e6d0b6a vocab : prevent tokenizer overflow (#14301)	7 months ago
Nicolò Scipione	8308f98c7f sycl: add usage of enqueue_functions extension (#14244)	7 months ago
Christian Kastner	6369be0735 Implement GGML_CPU_ALL_VARIANTS for PowerPC (#14286)	7 months ago
Sigbjørn Skjæret	88fc854b4b llama : improve sep token handling (#14272)	7 months ago
Diego Devesa	e28c1b93fd cuda : synchronize graph capture and cublas handle destruction (#14288)	7 months ago
Georgi Gerganov	d27b3ca175 ggml : fix repack work size for mul_mat_id (#14292)	7 months ago
Charles Xu	9230dbe2c7 ggml: Update KleidiAI to v1.9.0 (#14277)	7 months ago
Georgi Gerganov	812939a9e9 model : more uniform output id handling (#14275)	7 months ago
Georgi Gerganov	4c9fdfbe15 ubatch : new splitting logic (#14217)	7 months ago
Aman Gupta	9eaa51e7f0 CUDA: add conv_2d_dw (#14265)	7 months ago
Diego Devesa	8f71d0f3e8 ggml-cpu : remove unnecesary arm feature detection (#14281)	7 months ago
Alex Trotta	381174bbda gguf-py : make sentencepiece optional (#14200)	7 months ago
aa956	d67341dc18 server : add server parameters for draft model cache type (#13782)	7 months ago
fanyang	456af35eb7 build : suppress gcc15 compile warnings (#14261)	7 months ago
Anton Mitkov	600e3e9b50 sycl: Cleanup codepaths in Get Rows in sycl backend (#14215)	7 months ago
bashayer hijji	fffcce535e llama-bench : add --no-warmup flag (#14224) (#14270)	7 months ago
pqnet	5fc7856815 convert : fix remote option in Windows (#14100)	7 months ago
Aaron Teo	faed5a5f5d llamafile : support s390x SIMD instruction set (#14273)	7 months ago
0cc4m	10bb545c5b Vulkan: Set device max size for host memory to avoid OOM warning and fallback to CPU buffer (#14249)	7 months ago
Gabe Goodhart	edc4a29eff memory : Hybrid recurrent cache (#13979)	7 months ago
Georgi Gerganov	ed3290ab34 metal : add mean kernel (#14267)	7 months ago
Aaron Teo	8d94713654 docs: add s390x build documentation (#14264)	7 months ago

Newer Older

Commit History Find

Commit History