rmatif
|
92f7f0a53c
ggml: add `conv3d` op (#15182)
|
4 miesięcy temu |
Jeff Bolz
|
96452a3fa4
vulkan: Reuse conversion results in prealloc_y (#15410)
|
5 miesięcy temu |
Jeff Bolz
|
de5627910d
vulkan: Optimize argsort (#15354)
|
5 miesięcy temu |
Jeff Bolz
|
1fe00296f5
vulkan: fuse adds (#15252)
|
5 miesięcy temu |
Jeff Bolz
|
2e2b22ba66
vulkan: Add missing bounds checking to scalar/coopmat1 mul_mat_id (#15334)
|
5 miesięcy temu |
Georgi Gerganov
|
5edf1592fd
vulkan : fix out-of-bounds access in argmax kernel (#15342)
|
5 miesięcy temu |
Jonathan Graehl
|
5cdb27e091
finetune: SGD optimizer, more CLI args (#13873)
|
5 miesięcy temu |
Oliver Simons
|
6028bf7435
CUDA: Optimize `reduce_rows_f32` kernel, leading up to 25x perf improvement on kernel-level and 10% perf increase for Gemma3n (#15132)
|
5 miesięcy temu |
Georgi Gerganov
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 miesięcy temu |
Jeff Bolz
|
ec0b18802c
vulkan: Support ne[3]>1 in noncontig matrix-vector multiply (#15015)
|
5 miesięcy temu |
Sigbjørn Skjæret
|
138b288b59
cuda : add softcap fusion (#14907)
|
5 miesięcy temu |
Leonard Mosescu
|
bda62193b2
test-backend-ops : extend test case filtering (#14865)
|
5 miesięcy temu |
Erik Scholz
|
89d1029559
vulkan : add fp16 support for the conv_2d kernel (#14872)
|
5 miesięcy temu |
Aman Gupta
|
446595b9b3
Docs: add instructions for adding backends (#14889)
|
5 miesięcy temu |
Georgi Gerganov
|
18f3b5ff9e
tests : add non-cont K,V FA tests
|
6 miesięcy temu |
Aman Gupta
|
8c988fa41d
CUDA: add fused rms norm (#14800)
|
5 miesięcy temu |
Jeff Bolz
|
c2e058f1b4
vulkan/cuda: Fix im2col when KW!=KH (#14789)
|
6 miesięcy temu |
Ervin Áron Tasnádi
|
a979ca22db
ggml: adds CONV_2D op and direct GEMM Vulkan implementation (#14316)
|
6 miesięcy temu |
Georgi Gerganov
|
bf9087f59a
metal : fuse add, mul + add tests (#14596)
|
6 miesięcy temu |
Georgi Gerganov
|
225e7a1438
llama : add high-throughput mode (#14363)
|
6 miesięcy temu |
Tarek Dakhran
|
c31e60647d
tests : cover lfm2 cases in test_ssm_conv (#14651)
|
6 miesięcy temu |
Acly
|
3e303b1107
vulkan : implement ggml_roll (ggml/1290)
|
6 miesięcy temu |
Aman Gupta
|
11ee0fea2a
Docs: script to auto-generate ggml operations docs (#14598)
|
6 miesięcy temu |
compilade
|
a57d1bcb3c
cuda : support Falcon-H1 state size for SSM_SCAN (#14602)
|
6 miesięcy temu |
Xuan-Son Nguyen
|
98bab638fb
ggml : add ggml_scale_bias (#14417)
|
6 miesięcy temu |
Georgi Gerganov
|
4d0dcd4a06
cuda : fix rope with partial rotation and non-cont src (#14580)
|
6 miesięcy temu |
Jeff Bolz
|
e592be1575
vulkan: fix rms_norm+mul fusion (#14545)
|
6 miesięcy temu |
R0CKSTAR
|
b81510a7b7
test-backend-ops: add support for specifying output format (#14368)
|
6 miesięcy temu |
Johannes Gäßler
|
c8c4495b8d
ggml: backward pass for split swiglu (#14483)
|
6 miesięcy temu |
Georgi Gerganov
|
9067487c44
ggml : fix FA mask dim 2 and 3 (#14505)
|
6 miesięcy temu |