Ruben Ortlam
|
4dca015b7e
vulkan: Replace 16-bit unpack8 calls to work around legacy Windows AMD driver bug (#17285)
|
2 months ago |
Sigbjørn Skjæret
|
9a8860cf5d
convert : use all parts in safetensors index (#17286)
|
2 months ago |
Sigbjørn Skjæret
|
9d3ef4809f
convert : set expert gating func in base class (#17279)
|
2 months ago |
Ankur Verma
|
c7b7db0445
mtmd-cli: Avoid logging to stdout for model loading messages in mtmd-cli (#17277)
|
2 months ago |
Giuseppe Scrivano
|
1568d13c2c
vulkan: implement ABS and NEG (#17245)
|
2 months ago |
Jeff Bolz
|
439342ea0b
vulkan: Use ggml_vk_tensor_subbuffer in mul_mat_vec(id) paths (#17244)
|
2 months ago |
Jeff Bolz
|
234ae7d7bd
vulkan: skip all-negative-inf blocks in FA (#17186)
|
2 months ago |
Jeff Bolz
|
38eaf32af1
vulkan: change graph_compute to be async and enable get_tensor_async (#17158)
|
2 months ago |
Xuan-Son Nguyen
|
9b17d74ab7
mtmd: add mtmd_log_set (#17268)
|
2 months ago |
Bartowski
|
e1fcf8b09b
model : add AfmoeForCausalLM support (#16477)
|
2 months ago |
Marek Hradil jr.
|
6cd0cf72ce
fix : Dangling pointer for non-empty trigger words in lazy grammar construction (#17048)
|
2 months ago |
Georgi Gerganov
|
d396b43748
server : fix "can batch with" bug (#17263)
|
2 months ago |
Georgi Gerganov
|
45c6ef7307
metal : support argsort for ne00 > 1024 (#17247)
|
2 months ago |
Georgi Gerganov
|
2606b0adab
metal : make the FA extra sizes consistent (#17143)
|
2 months ago |
ixgbe
|
307772fcda
readme : add RVV,ZVFH,ZFH,ZICBOP support for RISC-V (#17259)
|
2 months ago |
Aleksander Grygier
|
f1bad23f88
Better UX for handling multiple attachments in WebUI (#17246)
|
2 months ago |
Alberto Cabrera Pérez
|
becc4816dd
ggml-cpu: handle 3d tensors in repack mat_mul (#17241)
|
2 months ago |
Xuan-Son Nguyen
|
c4abcb2457
server: fixing naming conflict res_error (#17243)
|
2 months ago |
Piotr Wilkin (ilintar)
|
389ac78b26
ggml : add ops SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM (#17063)
|
2 months ago |
Ruben Ortlam
|
a19bd6f7ce
vulkan: remove shell call from vulkan-shaders-gen tool, revert file check (#17219)
|
2 months ago |
Diego Devesa
|
dd091e52f8
sched : fix reserve ignoring user tensor assignments (#17232)
|
2 months ago |
ixgbe
|
1215dde7b0
ggml-cpu : add RISC-V vector intrinsic support for silu and cvar operations (#17227)
|
2 months ago |
bagheera
|
0cfb19166b
metal: accelerated conv2d (#17175)
|
2 months ago |
Georgi Gerganov
|
2776db6c81
Revert "ggml-cpu: handle 3d tensors in repack mat_mul (#17030)" (#17233)
|
2 months ago |
Diego Devesa
|
879dec341a
ggml-cpu : use template for argsort (#17222)
|
2 months ago |
TecJesh
|
97d5117217
CANN: Add cross_entropy_loss op support (#16886)
|
2 months ago |
Aman Gupta
|
a90eb94ca9
CUDA: fuse rope + set_rows (#16884)
|
2 months ago |
Neo Zhang Jianyu
|
07751f8d44
update SYCL support OPs (#17208)
|
2 months ago |
o7si
|
ffb6f3d921
vocab : correct bounds check for UGM XCDA array access (#17215)
|
2 months ago |
Johannes Gäßler
|
5d6838b74f
CUDA: static assert to prevent misuse of memcpy_1 (#17198)
|
2 months ago |