cturan/llama.cpp

Author	SHA1 Message	Date
Juk Armstrong	daa422881a llama : DeepSeek V2/V3 MLA implementation (#12801)	9 months ago
Srihari-mcw	eccc7a1602 ggml : Add AVX512 implementation of GEMM - Q4_Kx8 (#12829)	9 months ago
Chenguang Li	0019279bb5 CANN: Opt ROPE optimization (#12865)	9 months ago
Xinpeng Dou	b0c75ac9f9 CANN: Optimize CANN buffer pool memory management (#12875)	9 months ago
Russyyds	d6d2c2ab8c Add performance print for gemma3 in example (#12929)	9 months ago
Akarshan Biswas	75afa0ae31 SYCL: Fix im2col (#12910)	9 months ago
Radoslav Gerganov	c772d54926 rpc : use ggml_context_ptr (#12938)	9 months ago
Neo Zhang Jianyu	81c7e64fc2 dsiable curl lib check, this action is missed by commit bd3f59f81289b920bcc597a208c14f55e39ed37e (#12761) (#12937)	9 months ago
Georgi Gerganov	526739b879 sync : ggml	9 months ago
cmdr2	a25355e264 cpu: fix cpu backend's supports-op for GET_ROWS_BACK. fixes a fatal when running test-backend-ops with only the CPU backend (ggml/1190)	9 months ago
SXX	e959d32b1c ggml: use _mm[512/256]_dpbusd[_avx]_epi32 to directly accumulate into the result register (#12773)	9 months ago
Alan Gray	307bfa253d ggml: disable CUDA graphs for unsupported DUP and CONT node types (#12891)	9 months ago
Ed Addario	71e90e8813 quantize: Handle user-defined quantization levels for additional tensors (#12511)	9 months ago
Prajwal B Mehendarkar	bc091a4dc5 common : Define cache directory on AIX (#12915)	9 months ago
Jeff Bolz	a4837577aa vulkan: use aligned loads for flash attention mask (#12853)	9 months ago
Matt Clayton	e59ea539b8 llava: Fix cpu-only clip image encoding sefault (#12907)	9 months ago
Georgi Gerganov	c94085df28 server : add VSCode's Github Copilot Chat support (#12896)	9 months ago
yuri@FreeBSD	e8a62631b3 rpc : Set cache directory in rpc-server.cpp on FreeBSD (#12903)	9 months ago
Olivier Chafik	b6930ebc42 `tool-call`: fix non-tool-calling grammar crashes w/ Qwen / Hermes 2 templates (#12900)	9 months ago
yuri@FreeBSD	68b08f36d0 common : Define cache directory on FreeBSD (#12892)	9 months ago
Ewan Crawford	578754b315 sycl: Support sycl_ext_oneapi_limited_graph (#12873)	9 months ago
tastelikefeet	b2034c2b55 contrib: support modelscope community (#12664)	9 months ago
Yuxuan Zhang	06bb53ad9b llama-model : add Glm4Model implementation for GLM-4-0414 (#12867)	9 months ago
Xuan-Son Nguyen	0c50923944 clip : use smart pointer (⚠️ breaking change) (#12869)	9 months ago
Akarshan Biswas	fccf9cae83 SYCL: Add fp16 type support to unary op kernels (#12788)	9 months ago
Daniel Han	ec6c09d0fa convert : Llama4 RoPE fix (#12889)	9 months ago
R0CKSTAR	8ac9f5d765 ci : Replace freediskspace to free_disk_space in docker.yml (#12861)	9 months ago
Daniel Bevenius	12e9158f25 xcf : add check for visionos build version (#12854)	9 months ago
Xuan-Son Nguyen	5b1f13cb64 convert : proper tensor name mapping for llama4 (#12870)	9 months ago
Xuan-Son Nguyen	8b91d5355a llama : correct rms norm for llama 4 (#12882)	9 months ago

Newer Older

Commit History Find

Commit History