Charles Xu
|
c5d91a7400
ggml-cpu: Add CPU backend support for KleidiAI library (#11390)
|
11 months ago |
Weizhao Ouyang
|
198b1ec611
ggml-cpu: Fix duplicate MATMUL_INT8 (#11817)
|
11 months ago |
Sheldon Robinson
|
90e4dba461
Fix #11802: Compile bug - RegQueryValueExA changed to RegQueryValueEx (#11803)
|
11 months ago |
Johannes Gäßler
|
8137b4bb2b
CPU/CUDA: fix (GQA) mul mat back, add CUDA support (#11380)
|
1 year ago |
Johannes Gäßler
|
9c8dcefe17
CUDA: backwards pass for misc. ops, add tests (#11257)
|
1 year ago |
Johannes Gäßler
|
432df2d5f9
RoPE: fix back, CUDA support for back + noncont. (#11240)
|
1 year ago |
Diego Devesa
|
9177484f58
ggml : fix arm build (#10890)
|
1 year ago |
Georgi Gerganov
|
0006f5a74a
ggml : update ggml_backend_cpu_device_supports_op (#10867)
|
1 year ago |
Djip007
|
19d8762ab6
ggml : refactor online repacking (#10446)
|
1 year ago |
Diego Devesa
|
59f4db1088
ggml : add predefined list of CPU backend variants to build (#10626)
|
1 year ago |
Diego Devesa
|
7cc2d2c889
ggml : move AMX to the CPU backend (#10570)
|
1 year ago |
Shupei Fan
|
c202cef168
ggml-cpu: support IQ4_NL_4_4 by runtime repack (#10541)
|
1 year ago |
Diego Devesa
|
5931c1f233
ggml : add support for dynamic loading of backends (#10469)
|
1 year ago |
Charles Xu
|
1607a5e5b0
backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (#9921)
|
1 year ago |
Diego Devesa
|
ae8de6d50a
ggml : build backends as libraries (#10256)
|
1 year ago |