Aaron Teo
|
186415d595
ggml-cpu: drop support for nnpa intrinsics (#15821)
|
4 months ago |
AN Long
|
cd6983d56d
ggml : fix field name when new ggml_backend (#14944)
|
5 months ago |
Diego Devesa
|
0d8831543c
ggml : fix fallback to CPU for ununsupported ops (#15118)
|
5 months ago |
Radoslav Gerganov
|
8d94219a4a
ggml : add ggml_set_rows (#14274)
|
7 months ago |
Aaron Teo
|
60ef23d6c1
ggml-cpu: enable IBM NNPA Vector Intrinsics (#14317)
|
7 months ago |
xctan
|
f470bc36be
ggml-cpu : split arch-specific implementations (#13892)
|
7 months ago |
Diego Devesa
|
9fdfcdaedd
rpc : use backend registry, support dl backends (#13304)
|
8 months ago |
cmdr2
|
a25355e264
cpu: fix cpu backend's supports-op for GET_ROWS_BACK. fixes a fatal when running test-backend-ops with only the CPU backend (ggml/1190)
|
9 months ago |
Rémy O
|
07d1572347
ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions (#12154)
|
10 months ago |
Aaron Teo
|
af7747c95a
ggml-cpu: Support s390x SIMD Instruction Set (#12019)
|
11 months ago |
Charles Xu
|
c5d91a7400
ggml-cpu: Add CPU backend support for KleidiAI library (#11390)
|
11 months ago |
Weizhao Ouyang
|
198b1ec611
ggml-cpu: Fix duplicate MATMUL_INT8 (#11817)
|
11 months ago |
Sheldon Robinson
|
90e4dba461
Fix #11802: Compile bug - RegQueryValueExA changed to RegQueryValueEx (#11803)
|
11 months ago |
Johannes Gäßler
|
8137b4bb2b
CPU/CUDA: fix (GQA) mul mat back, add CUDA support (#11380)
|
1 year ago |
Johannes Gäßler
|
9c8dcefe17
CUDA: backwards pass for misc. ops, add tests (#11257)
|
1 year ago |
Johannes Gäßler
|
432df2d5f9
RoPE: fix back, CUDA support for back + noncont. (#11240)
|
1 year ago |
Diego Devesa
|
9177484f58
ggml : fix arm build (#10890)
|
1 year ago |
Georgi Gerganov
|
0006f5a74a
ggml : update ggml_backend_cpu_device_supports_op (#10867)
|
1 year ago |
Djip007
|
19d8762ab6
ggml : refactor online repacking (#10446)
|
1 year ago |
Diego Devesa
|
59f4db1088
ggml : add predefined list of CPU backend variants to build (#10626)
|
1 year ago |
Diego Devesa
|
7cc2d2c889
ggml : move AMX to the CPU backend (#10570)
|
1 year ago |
Shupei Fan
|
c202cef168
ggml-cpu: support IQ4_NL_4_4 by runtime repack (#10541)
|
1 year ago |
Diego Devesa
|
5931c1f233
ggml : add support for dynamic loading of backends (#10469)
|
1 year ago |
Charles Xu
|
1607a5e5b0
backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (#9921)
|
1 year ago |
Diego Devesa
|
ae8de6d50a
ggml : build backends as libraries (#10256)
|
1 year ago |