Neo Zhang Jianyu
|
de17e3f745
fix memcpy() crash, add missed cmd in guide, fix softmax (#6622)
|
1 年之前 |
Abhilash Majumder
|
87fb5b4234
remove row=1 cond (#6532)
|
1 年之前 |
Neo Zhang Jianyu
|
d4f220a5cc
support/fix OPs GGML_TYPE_IQ4_NL, GGML_TYPE_IQ4_XS, GGML_TYPE_IQ3_XXS, GGML_TYPE_IQ3_S, GGML_TYPE_IQ2_XXS, GGML_TYPE_IQ2_XS, GGML_TYPE_IQ2_S, GGML_TYPE_IQ1_S, GGML_TYPE_IQ1_M (#6521)
|
1 年之前 |
Ouadie EL FAROUKI
|
1b496a745c
[SYCL] Fixed minor bug when enabling FP16 for non intel targets (#6464)
|
1 年之前 |
Meng, Hengyu
|
52604860f9
[SYCL] Disable iqx on windows as WA (#6435)
|
1 年之前 |
Neo Zhang Jianyu
|
25f4a613c4
[SYCL] fix set main gpu crash (#6339)
|
1 年之前 |
AidanBeltonS
|
e82f9e2b83
[SYCL] Fix batched impl for NVidia GPU (#6164)
|
1 年之前 |
compilade
|
557410b8f0
llama : greatly reduce output buffer memory usage (#6122)
|
1 年之前 |
Meng, Hengyu
|
ddf6568510
[SYCL] offload op (#6217)
|
1 年之前 |
AidanBeltonS
|
c5b8595e3f
Add nvidia and amd backends (#6157)
|
1 年之前 |
slaren
|
2bf8d0f7c4
backend : offload large batches to GPU (#6083)
|
1 年之前 |
Neo Zhang Jianyu
|
46acb36767
fix set main gpu error (#6073)
|
1 年之前 |
AidanBeltonS
|
753e36f650
[SYCL] Fix non-intel device selection (#6042)
|
1 年之前 |
slaren
|
f30ea47a87
llama : add pipeline parallelism support (#6017)
|
1 年之前 |
AidanBeltonS
|
b3d978600f
Update get version (#6025)
|
1 年之前 |
Georgi Gerganov
|
8030da7afe
ggml : reuse quantum structs across backends (#5943)
|
1 年之前 |
Georgi Gerganov
|
48358b2e5b
sycl : update IQ1_S kernels (WIP - not working!) (#5995)
|
1 年之前 |
Abhilash Majumder
|
ef3ced26a3
[SYCL] Add q3_s and q1_s (#5886)
|
1 年之前 |
Georgi Gerganov
|
8a3012a4ad
ggml : add ggml-common.h to deduplicate shared code (#5940)
|
1 年之前 |
Neo Zhang Jianyu
|
89fb735fcf
Revert "[SYCL] fix error when set main gpu to non-zero (#5901)" (#5918)
|
1 年之前 |
Neo Zhang Jianyu
|
ceca1aef07
[SYCL] fix error when set main gpu to non-zero (#5901)
|
1 年之前 |
Neo Zhang Jianyu
|
8ced9f7e32
add wait() to make code stable (#5895)
|
1 年之前 |
Neo Zhang Jianyu
|
21b0867433
[SYCL] fix mul_mat fault in CI/unit-test (#5862)
|
1 年之前 |
Michael Podvitskiy
|
9fa2627347
ggml : introduce ggml_status (ggml/750)
|
1 年之前 |
Neo Zhang Jianyu
|
715641391d
Support multiple GPUs (split mode) on SYCL backend (#5806)
|
1 年之前 |
AidanBeltonS
|
38d1521608
[SYCL] Use batched mul_mat pathway (#5591)
|
1 年之前 |
UEXTM.com
|
5f70671856
Introduce backend GUIDs (ggml/743)
|
1 年之前 |
AidanBeltonS
|
e849078c6e
[SYCL] Add support for soft_max ALiBi (#5639)
|
1 年之前 |
Georgi Gerganov
|
ab336a9d5e
code : normalize enum names (#5697)
|
1 年之前 |
Meng, Hengyu
|
88c46cbdac
[SYCL] conext add name (#5624)
|
1 年之前 |