Georgi Gerganov
|
1e15bfd42c
graph : fix stack-use-after-return (#14960)
|
6 月之前 |
Douglas Hanley
|
a118d80233
embeddings: fix extraction of CLS pooling results (#14927)
|
6 月之前 |
Xinpeng Dou
|
61550f8231
CANN: update ops docs (#14935)
|
6 月之前 |
uvos
|
aa79524c51
HIP: remove the use of __HIP_PLATFORM_AMD__, explicitly support only AMD targets (#14945)
|
6 月之前 |
uvos
|
b77d11179d
HIP: add GGML_HIP_MMQ_MFMA option to allow disableing the MFMA path. (#14930)
|
6 月之前 |
uvos
|
c7aa1364fd
HIP: Ignore unsupported unroll transformation in fattn-vec (#14931)
|
6 月之前 |
kallewoof
|
1a67fcc306
common : avoid logging partial messages (which can contain broken UTF-8 sequences) (#14937)
|
6 月之前 |
hipudding
|
204f2cf168
CANN: Add ggml_set_rows (#14943)
|
6 月之前 |
Sigbjørn Skjæret
|
138b288b59
cuda : add softcap fusion (#14907)
|
6 月之前 |
Johannes Gäßler
|
bbd0f91779
server-bench: make seed choice configurable (#14929)
|
6 月之前 |
Aman Gupta
|
0a5036bee9
CUDA: add roll (#14919)
|
6 月之前 |
lhez
|
8ad7b3e65b
opencl : add ops docs (#14910)
|
6 月之前 |
Leonard Mosescu
|
bda62193b2
test-backend-ops : extend test case filtering (#14865)
|
6 月之前 |
Radoslav Gerganov
|
c556418b60
llama-bench : use local GPUs along with RPC servers (#14917)
|
6 月之前 |
xctan
|
db16e2831c
ggml-cpu : deduplicate scalar implementations (#14897)
|
6 月之前 |
Akarshan Biswas
|
cd1fce6d4f
SYCL: Add set_rows support for quantized types (#14883)
|
6 月之前 |
Xuan-Son Nguyen
|
00fa15fedc
mtmd : add support for Voxtral (#14862)
|
6 月之前 |
Johannes Gäßler
|
946b1f6859
CUDA: fix pointer incrementation in FA (#14916)
|
6 月之前 |
Dongliang Wei
|
6c6e397aff
model : add support for SmallThinker series (#14898)
|
6 月之前 |
Alberto Cabrera Pérez
|
afc0e89698
sycl: refactor quantization to q8_1 (#14815)
|
6 月之前 |
Georgi Gerganov
|
a5771c9eea
ops : update BLAS (#14914)
|
6 月之前 |
Georgi Gerganov
|
c35f9eaf09
ops : update Metal (#14912)
|
6 月之前 |
Georgi Gerganov
|
1f45f2890e
sync : ggml
|
6 月之前 |
Kai Pastor
|
613c5095c3
cmake : Indent ggml-config.cmake (ggml/1310)
|
6 月之前 |
Ed Addario
|
7f97599581
quantize : update README.md (#14905)
|
6 月之前 |
Ruben Ortlam
|
bf78f5439e
vulkan: add ops docs (#14900)
|
6 月之前 |
Akarshan Biswas
|
bbfc849274
SYCL: add ops doc (#14901)
|
6 月之前 |
Daniel Bevenius
|
ca0ef2dddb
llama : clarify comment about pp and tg graphs [no ci] (#14895)
|
6 月之前 |
Erik Scholz
|
89d1029559
vulkan : add fp16 support for the conv_2d kernel (#14872)
|
6 月之前 |
Jeff Bolz
|
f1a4e72de5
vulkan: skip empty set_rows to avoid invalid API usage (#14860)
|
6 月之前 |