AN Long
|
cd6983d56d
ggml : fix field name when new ggml_backend (#14944)
|
5 months ago |
Jeff Bolz
|
c4f53563df
vulkan: support fattn sinks (#15126)
|
5 months ago |
Jeff Bolz
|
a0552c8bee
vulkan: Add env var to disable host visible vidmem (#15109)
|
5 months ago |
Georgi Gerganov
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 months ago |
Jeff Bolz
|
5aa1105da2
vulkan: fix build when using glslang that does not support coopmat2 (#15062)
|
5 months ago |
Jeff Bolz
|
6c7a441161
vulkan: Use coopmat2 for conv2d (#14982)
|
5 months ago |
Jeff Bolz
|
4cb208c93c
vulkan: coopmat2 mul_mat optimizations (#14934)
|
5 months ago |
Jeff Bolz
|
ec0b18802c
vulkan: Support ne[3]>1 in noncontig matrix-vector multiply (#15015)
|
5 months ago |
Jeff Bolz
|
a9f7541ec2
vulkan: optimizations for direct convolution (#14933)
|
5 months ago |
Ruben Ortlam
|
e08a98826b
Vulkan: Fix minor debug mode issues (#14899)
|
5 months ago |
Kai Pastor
|
73a8e5ca03
vulkan : fix 32-bit builds (ggml/1313)
|
6 months ago |
Erik Scholz
|
89d1029559
vulkan : add fp16 support for the conv_2d kernel (#14872)
|
6 months ago |
Jeff Bolz
|
f1a4e72de5
vulkan: skip empty set_rows to avoid invalid API usage (#14860)
|
6 months ago |
Jeff Bolz
|
84712b6043
vulkan: fix rms_norm_mul to handle broadcasting dim0 (#14817)
|
6 months ago |
Ervin Áron Tasnádi
|
a979ca22db
ggml: adds CONV_2D op and direct GEMM Vulkan implementation (#14316)
|
6 months ago |
Peter0x44
|
d4b91ea7b2
vulkan: Add logging for bf16 features to ggml_vk_print_gpu_info (#13274) (#14707)
|
6 months ago |
Jeff Bolz
|
ba1ceb3456
vulkan: fix noncontig check for mat_mul_id splitting (#14683)
|
6 months ago |
Jeff Bolz
|
10a0351a97
vulkan: add RTE variants for glu/add/sub/mul/div (#14653)
|
6 months ago |
Georgi Gerganov
|
3120413ccd
vulkan : remove unused vars (#0)
|
6 months ago |
Acly
|
74bb294591
vulkan : implement bilinear interpolation (ggml/1291)
|
6 months ago |
Acly
|
3e303b1107
vulkan : implement ggml_roll (ggml/1290)
|
6 months ago |
Jeff Bolz
|
b3ad3a0191
vulkan: support SET_ROWS (#14587)
|
6 months ago |
Jeff Bolz
|
98197e5c98
vulkan: optimizations for deepseek prompt processing (#14555)
|
6 months ago |
Xuan-Son Nguyen
|
98bab638fb
ggml : add ggml_scale_bias (#14417)
|
6 months ago |
Jeff Bolz
|
6efcd65945
vulkan: optimize flash attention split_k_reduce (#14554)
|
6 months ago |
Jeff Bolz
|
e592be1575
vulkan: fix rms_norm+mul fusion (#14545)
|
6 months ago |
Jeff Bolz
|
a0374a67e2
vulkan: Handle updated FA dim2/3 definition (#14518)
|
6 months ago |
Sigbjørn Skjæret
|
28657a8229
ggml : implement GEGLU_ERF and GEGLU_QUICK ops (#14445)
|
6 months ago |
Jeff Bolz
|
2b72bedec1
vulkan: support mixed/deepseekR1 FA head sizes (#14509)
|
6 months ago |
Georgi Gerganov
|
a70c8a0c4b
kv-cache : use ggml_set_rows (#14285)
|
6 months ago |