Ruben Ortlam
|
7f3e9d339c
vulkan: iGPU memory reporting fix (#17110)
|
2 months ago |
Ruben Ortlam
|
8a3519b708
vulkan: fix mmq out of bounds reads (#17108)
|
2 months ago |
Jeff Bolz
|
80a6cf6347
vulkan: fuse mul_mat_id + mul (#17095)
|
2 months ago |
Jeff Bolz
|
53d7d21e61
vulkan: Use spec constants for conv2d s/d/p and kernel W/H (#16978)
|
2 months ago |
Jeff Bolz
|
b4e335d8dc
vulkan: fuse rms_norm + mul + rope (+ view + set_rows) (#16977)
|
2 months ago |
Jeff Bolz
|
d6fe40fa00
vulkan: Fix test-thread-safety crashes (#17024)
|
2 months ago |
Acly
|
ac76d36201
vulkan : refactor buffer handling in vk_op_f32 (#16840)
|
2 months ago |
Jeff Bolz
|
a44d77126c
vulkan: Fix GGML_VULKAN_CHECK_RESULTS to better handle fusion (#16919)
|
2 months ago |
Jeff Bolz
|
ad51c0a720
vulkan: remove the need for the dryrun (#16826)
|
2 months ago |
Jeff Bolz
|
5d8bb900bc
vulkan: Fix multi_add invalid descriptor usage (#16899)
|
2 months ago |
Jeff Bolz
|
2e76e01360
vulkan: fuse mul_mat+add and mul_mat_id+add_id (#16868)
|
2 months ago |
Masato Nakasaka
|
2976b0374d
vulkan: Fix crash when FP16 mul_mat accumulation is not supported (#16796)
|
2 months ago |
JJJYmmm
|
d261223d24
model: add support for qwen3vl series (#16780)
|
3 months ago |
Jeff Bolz
|
052df28b0e
vulkan: Handle argsort with a large number of rows (#16851)
|
3 months ago |
Jeff Bolz
|
b9ce940177
vulkan: Fuse rope+set_rows (#16769)
|
3 months ago |
Jeff Bolz
|
10fcc41290
vulkan: Update topk_moe fusion to handle gpt's late softmax (#16656)
|
3 months ago |
Ruben Ortlam
|
bcf5bda6f5
Vulkan MMQ Integer Dot Refactor and K-Quant support (#16536)
|
3 months ago |
Jeff Bolz
|
f549b0007d
vulkan: Call ggml_vk_buffer_write_2d from ggml_vk_buffer_copy (#16793)
|
3 months ago |
Acly
|
10640e31aa
ggml : fix interpolate with align-corners and ne=1 (#16700)
|
3 months ago |
Gilad S.
|
3cfa9c3f12
vulkan: deduplicate Microsoft Direct3D12 devices (#16689)
|
3 months ago |
Giuseppe Scrivano
|
f90b4a8efe
vulkan: delete dead code (#16732)
|
3 months ago |
Jeff Bolz
|
8423d01931
vulkan: Optimize SSM_SCAN (#16645)
|
3 months ago |
Jeff Bolz
|
e56abd2098
vulkan: Implement topk_moe fused shader, ported from CUDA (#16641)
|
3 months ago |
Giuseppe Scrivano
|
3d4e86bbeb
vulkan: Add State Space Model (SSM) Operations Support (#16463)
|
3 months ago |
Jeff Bolz
|
4258e0cfe7
vulkan: Support FA with K/V in F32 (#16543)
|
3 months ago |
Jeff Bolz
|
2aaf0a2a20
vulkan: Replace uses of maxMemoryAllocationSize and VK_WHOLE_SIZE (#16354)
|
3 months ago |
Jeff Bolz
|
e308efda8e
vulkan: in flash attention, bounds check against nem1 (don't rely on GGML_KQ_MASK_PAD) (#16316)
|
3 months ago |
Eve
|
132d673554
vulkan: make ggml_vk_default_dispatcher support older vulkan headers (#16345)
|
3 months ago |
Jeff Bolz
|
d8359f5fde
vulkan: 64-bit im2col (#16135)
|
4 months ago |
Jeff Bolz
|
1384abf8b8
vulkan: handle mat_mul with A matrix > 4GB (#16176)
|
4 months ago |