Aman Gupta
|
4926419c4d
ggml: add ggml_can_fuse_subgraph (#16662)
|
2 月之前 |
lhez
|
6ea37f5739
opencl: fix warnings and clean up profiling (#16688)
|
2 月之前 |
Jeff Bolz
|
fb349848f3
vulkan: Handle FA with all -inf mask values (#16447)
|
2 月之前 |
YehuditE
|
6de8ed7519
sycl : add PAD_REFLECT_D1 operator support (#16145)
|
2 月之前 |
Sigbjørn Skjæret
|
84bf3c6778
model : add BailingMoeV2 support (#16063)
|
2 月之前 |
Aleksander Grygier
|
c9c1972e2c
Handle legacy 'context' attachments (#16687)
|
2 月之前 |
Diego Devesa
|
b617cfd289
ggml-alloc : fix leak when reusing a tensor with a larger size (#16679)
|
2 月之前 |
Aleksander Grygier
|
79068501fa
Prevent premature submission on IME input (#16673)
|
2 月之前 |
Aleksander Grygier
|
0e4a0cf2fa
Import/Export UX improvements (#16619)
|
2 月之前 |
Aleksander Grygier
|
13f2cfad41
Enable per-conversation loading states to allow having parallel conversations (#16327)
|
2 月之前 |
takuya kodama
|
06332e2867
llama-batch: fix build fails with `-Werror=missing-braces` (#16614)
|
2 月之前 |
Ron Evans
|
72d53e6c4d
readme: update bindings (#16651)
|
2 月之前 |
safranowith
|
2330de7b84
SYCL: Add support for FLOOR,CEIL,ROUND and TRUNC unary operators (#16613)
|
2 月之前 |
takuya kodama
|
7062dd8460
llama-context: only warn on pooling_type when user specified (#16674)
|
2 月之前 |
Giuseppe Scrivano
|
0398752dd4
model : add Granite Hybrid types (#16635)
|
2 月之前 |
Aaron Teo
|
4f73d0a951
ci : fix binaries release failure for s390x (binaries may not work yet) (#16664)
|
2 月之前 |
Sigbjørn Skjæret
|
cec5edbcae
ci : avoid manual updates of docs/ops.md (#16663)
|
3 月之前 |
Aaron Teo
|
fcb235b466
ci: include s390x release binaries (#16648)
|
3 月之前 |
Aman Gupta
|
55754bebd5
CODEOWNERS: update for ggml-cuda/mmf (#16660)
|
3 月之前 |
Johannes Gäßler
|
ee09828cb0
HIP: fix GPU_TARGETS (#16642)
|
3 月之前 |
Jeff Bolz
|
e56abd2098
vulkan: Implement topk_moe fused shader, ported from CUDA (#16641)
|
3 月之前 |
Aman Gupta
|
38355c6c8e
CUDA: use registers instead of smem in topk-moe (#16647)
|
3 月之前 |
Shawn Gu
|
81387858f1
opencl: transposed gemm/gemv moe kernel with mxfp4,f32 (#16602)
|
3 月之前 |
Johannes Gäßler
|
66b0dbcb2d
llama-model: fix insonsistent ctxs <-> bufs order (#16581)
|
3 月之前 |
Radoslav Gerganov
|
41386cf365
rpc : report actual free memory (#16616)
|
3 月之前 |
Giuseppe Scrivano
|
3d4e86bbeb
vulkan: Add State Space Model (SSM) Operations Support (#16463)
|
3 月之前 |
muggle-stack
|
342c728d03
ggml : fix SpaceMit IME array out-of-bounds in task assignment (#16629)
|
3 月之前 |
Pascal
|
ababae7e1e
webui: reorganize settings layout (#16607)
|
3 月之前 |
Jeff Bolz
|
b19491599d
vulkan: fix debug build (add_rms_len/data not found) (#16624)
|
3 月之前 |
Ilia Ilmer
|
9ad4f1931e
metal : add `CONV_TRANSPOSE_2D` (#16542)
|
3 月之前 |