cturan/llama.cpp

Author	SHA1 Message	Date
fanyang	456af35eb7 build : suppress gcc15 compile warnings (#14261)	7 months ago
Anton Mitkov	600e3e9b50 sycl: Cleanup codepaths in Get Rows in sycl backend (#14215)	7 months ago
bashayer hijji	fffcce535e llama-bench : add --no-warmup flag (#14224) (#14270)	7 months ago
pqnet	5fc7856815 convert : fix remote option in Windows (#14100)	7 months ago
Aaron Teo	faed5a5f5d llamafile : support s390x SIMD instruction set (#14273)	7 months ago
0cc4m	10bb545c5b Vulkan: Set device max size for host memory to avoid OOM warning and fallback to CPU buffer (#14249)	7 months ago
Gabe Goodhart	edc4a29eff memory : Hybrid recurrent cache (#13979)	7 months ago
Georgi Gerganov	ed3290ab34 metal : add mean kernel (#14267)	7 months ago
Aaron Teo	8d94713654 docs: add s390x build documentation (#14264)	7 months ago
Aaron Teo	50d2227953 ggml-cpu: reduce asm calls for hsum (#14037)	7 months ago
Aaron Teo	6231c5cd6d ggml-cpu: fix uncaught underscore terminators (#14023)	7 months ago
Charles Xu	ef035803eb ggml: Add Apple support for GGML_CPU_ALL_VARIANTS (#14258)	7 months ago
Xuan-Son Nguyen	413977de32 mtmd : refactor llava-uhd preprocessing logic (#14247)	7 months ago
Xuan-Son Nguyen	95402553a5 llama-chat : fix multiple system message for gemma, orion (#14246)	7 months ago
Sigbjørn Skjæret	3865cff4f5 convert : fix null head_dim AutoConfig regression (#14248)	7 months ago
Georgi Gerganov	d03172cc79 sync : ggml	7 months ago
Daniel Bevenius	dd8e59f443 ggml : disable warnings for tests when using MSVC (ggml/1273)	7 months ago
Daniel Bevenius	bbe98d2784 ggml : remove unused ggml_context_container (ggml/1272)	7 months ago
Daniel Bevenius	c2056ed6d4 examples : include examples in msvc disable warn (ggml/1270)	7 months ago
bandoti	c46503014d cmake: remove shader-gen step-targets from ggml-vulkan (#14226)	7 months ago
xctan	860a9e4eef ggml-cpu : remove the weak alias trick (#14221)	7 months ago
R0CKSTAR	fe9d60e74a musa: fix build warning (unused variable) (#14231)	7 months ago
Sigbjørn Skjæret	e434e69183 common : suggest --jinja when autodetection fails (#14222)	7 months ago
Georgi Gerganov	89fea80d29 server : fix incorrect usage of llama_get_embeddings() (#14225)	7 months ago
Diego Devesa	6adc3c3ebc llama : add thread safety test (#14035)	7 months ago
bandoti	0dbcabde8c cmake: clean up external project logic for vulkan-shaders-gen (#14179)	7 months ago
Đinh Trọng Huy	ad590be98c model : add NeoBERT (#14164)	7 months ago
uvos	7d6d91babf HIP: disable rocwmma on gfx12 by default until rocm 7.0 (#14202)	7 months ago
Georgi Gerganov	d3e64b9f49 llama : rework embeddings logic (#14208)	7 months ago
Charles Xu	3ba0d843c6 ggml: Add Android support for GGML_CPU_ALL_VARIANTS (#14206)	7 months ago

Newer Older

Commit History Find

Commit History