Russyyds
|
d6d2c2ab8c
Add performance print for gemma3 in example (#12929)
|
9 months ago |
Akarshan Biswas
|
75afa0ae31
SYCL: Fix im2col (#12910)
|
9 months ago |
Radoslav Gerganov
|
c772d54926
rpc : use ggml_context_ptr (#12938)
|
9 months ago |
Neo Zhang Jianyu
|
81c7e64fc2
dsiable curl lib check, this action is missed by commit bd3f59f81289b920bcc597a208c14f55e39ed37e (#12761) (#12937)
|
9 months ago |
Georgi Gerganov
|
526739b879
sync : ggml
|
9 months ago |
cmdr2
|
a25355e264
cpu: fix cpu backend's supports-op for GET_ROWS_BACK. fixes a fatal when running test-backend-ops with only the CPU backend (ggml/1190)
|
9 months ago |
SXX
|
e959d32b1c
ggml: use _mm[512/256]_dpbusd[_avx]_epi32 to directly accumulate into the result register (#12773)
|
9 months ago |
Alan Gray
|
307bfa253d
ggml: disable CUDA graphs for unsupported DUP and CONT node types (#12891)
|
9 months ago |
Ed Addario
|
71e90e8813
quantize: Handle user-defined quantization levels for additional tensors (#12511)
|
9 months ago |
Prajwal B Mehendarkar
|
bc091a4dc5
common : Define cache directory on AIX (#12915)
|
9 months ago |
Jeff Bolz
|
a4837577aa
vulkan: use aligned loads for flash attention mask (#12853)
|
9 months ago |
Matt Clayton
|
e59ea539b8
llava: Fix cpu-only clip image encoding sefault (#12907)
|
9 months ago |
Georgi Gerganov
|
c94085df28
server : add VSCode's Github Copilot Chat support (#12896)
|
9 months ago |
yuri@FreeBSD
|
e8a62631b3
rpc : Set cache directory in rpc-server.cpp on FreeBSD (#12903)
|
9 months ago |
Olivier Chafik
|
b6930ebc42
`tool-call`: fix non-tool-calling grammar crashes w/ Qwen / Hermes 2 templates (#12900)
|
9 months ago |
yuri@FreeBSD
|
68b08f36d0
common : Define cache directory on FreeBSD (#12892)
|
9 months ago |
Ewan Crawford
|
578754b315
sycl: Support sycl_ext_oneapi_limited_graph (#12873)
|
9 months ago |
tastelikefeet
|
b2034c2b55
contrib: support modelscope community (#12664)
|
9 months ago |
Yuxuan Zhang
|
06bb53ad9b
llama-model : add Glm4Model implementation for GLM-4-0414 (#12867)
|
9 months ago |
Xuan-Son Nguyen
|
0c50923944
clip : use smart pointer (⚠️ breaking change) (#12869)
|
9 months ago |
Akarshan Biswas
|
fccf9cae83
SYCL: Add fp16 type support to unary op kernels (#12788)
|
9 months ago |
Daniel Han
|
ec6c09d0fa
convert : Llama4 RoPE fix (#12889)
|
9 months ago |
R0CKSTAR
|
8ac9f5d765
ci : Replace freediskspace to free_disk_space in docker.yml (#12861)
|
9 months ago |
Daniel Bevenius
|
12e9158f25
xcf : add check for visionos build version (#12854)
|
9 months ago |
Xuan-Son Nguyen
|
5b1f13cb64
convert : proper tensor name mapping for llama4 (#12870)
|
9 months ago |
Xuan-Son Nguyen
|
8b91d5355a
llama : correct rms norm for llama 4 (#12882)
|
9 months ago |
Aaron Teo
|
0fed24c347
ggml: fix compilation error s390x (#12848)
|
9 months ago |
Georgi Gerganov
|
47ba87d0a4
sync : ggml
|
9 months ago |
Georgi Gerganov
|
1d2b613445
tests : fix init order (#0)
|
9 months ago |
Georgi Gerganov
|
eb420e1148
sync : ggml
|
9 months ago |