Tarek Dakhran
|
c31e60647d
tests : cover lfm2 cases in test_ssm_conv (#14651)
|
6 meses atrás |
Acly
|
3e303b1107
vulkan : implement ggml_roll (ggml/1290)
|
6 meses atrás |
Aman Gupta
|
11ee0fea2a
Docs: script to auto-generate ggml operations docs (#14598)
|
6 meses atrás |
compilade
|
a57d1bcb3c
cuda : support Falcon-H1 state size for SSM_SCAN (#14602)
|
6 meses atrás |
Xuan-Son Nguyen
|
98bab638fb
ggml : add ggml_scale_bias (#14417)
|
6 meses atrás |
Georgi Gerganov
|
4d0dcd4a06
cuda : fix rope with partial rotation and non-cont src (#14580)
|
6 meses atrás |
Jeff Bolz
|
e592be1575
vulkan: fix rms_norm+mul fusion (#14545)
|
6 meses atrás |
R0CKSTAR
|
b81510a7b7
test-backend-ops: add support for specifying output format (#14368)
|
6 meses atrás |
Johannes Gäßler
|
c8c4495b8d
ggml: backward pass for split swiglu (#14483)
|
6 meses atrás |
Georgi Gerganov
|
9067487c44
ggml : fix FA mask dim 2 and 3 (#14505)
|
6 meses atrás |
Aman Gupta
|
55c2646b45
CUDA: add dynamic shared mem to softmax, refactor general usage (#14497)
|
6 meses atrás |
compilade
|
5d46babdc2
llama : initial Mamba-2 support (#9126)
|
6 meses atrás |
Georgi Gerganov
|
ec68e84c32
ggml : support bcast ggml_soft_max_ext, ggml_flash_attn_ext (#14435)
|
6 meses atrás |
Jeff Bolz
|
6a746cf9c4
vulkan: Split large mul_mat_id to fit in shared memory (#14451)
|
6 meses atrás |
Acly
|
431b2c24f3
ggml-cpu : "align corners" for bilinear upscale/downscale (ggml/1285)
|
6 meses atrás |
Diego Devesa
|
eb3fa2913e
test-backend-ops : disable llama test (#14461)
|
6 meses atrás |
Sigbjørn Skjæret
|
a0535ffa0d
ggml : implement REGLU/GEGLU/SWIGLU ops (#14158)
|
6 meses atrás |
Jeff Bolz
|
bd9c981d72
vulkan: Add fusion support for RMS_NORM+MUL (#14366)
|
6 meses atrás |
Aman Gupta
|
27208bf657
CUDA: add bf16 and f32 support to cublas_mul_mat_batched (#14361)
|
6 meses atrás |
Radoslav Gerganov
|
8d94219a4a
ggml : add ggml_set_rows (#14274)
|
6 meses atrás |
Georgi Gerganov
|
e8215dbb96
metal : add special-case mat-vec mul for ne00 == 4 (#14385)
|
6 meses atrás |
Aman Gupta
|
aa064b2eb7
CUDA: add mean operation (#14313)
|
6 meses atrás |
Aman Gupta
|
c959f462a0
CUDA: add conv_2d_transpose (#14287)
|
7 meses atrás |
Ervin Áron Tasnádi
|
0d3984424f
ggml-vulkan: adds support for op CONV_TRANSPOSE_1D (#13813)
|
7 meses atrás |
Johannes Gäßler
|
10d2af0eaa
llama/ggml: add LLM training support (#10544)
|
8 meses atrás |
Georgi Gerganov
|
b34443923c
sync : ggml (#13268)
|
8 meses atrás |
Johannes Gäßler
|
b0ecbd434b
test: non-cont. b in test-backend-ops -o MUL_MAT (#13187)
|
8 meses atrás |
Johannes Gäßler
|
e1e8e0991f
CUDA: batched+noncont MMQ, refactor bs>1 MoE code (#13199)
|
8 meses atrás |
Xuan-Son Nguyen
|
edb18b6e8f
clip : fix pixtral on some GPU backends (#13097)
|
8 meses atrás |
Johannes Gäßler
|
658987cfc9
CUDA: noncont MMVQ + batched bs1 MUL_MAT_ID (#13014)
|
8 meses atrás |