Georgi Gerganov
|
2776db6c81
Revert "ggml-cpu: handle 3d tensors in repack mat_mul (#17030)" (#17233)
|
2 months ago |
Alberto Cabrera Pérez
|
1c398dc9ec
ggml-cpu: handle 3d tensors in repack mat_mul (#17030)
|
2 months ago |
Noah
|
1f5accb8d0
Fix garbled output with REPACK at high thread counts (#16956)
|
2 months ago |
Max Krasnyansky
|
517b7170e1
cpu: introduce chunking for repack matmuls and enable matmul-id chunking on ARM64 (#16833)
|
2 months ago |
Georgi Gerganov
|
00f35d509e
ggml : repack block_iq4_nlx8 (#14904)
|
5 months ago |
Srihari-mcw
|
baad94885d
ggml : Q2k interleaving implementation - x86/x64 SIMD (#14373)
|
5 months ago |
Daniel Bevenius
|
5592f278b6
ggml-cpu : remove stdlib include from repack.cpp (ggml/1276)
|
6 months ago |
Aaron Teo
|
60ef23d6c1
ggml-cpu: enable IBM NNPA Vector Intrinsics (#14317)
|
7 months ago |
Christian Kastner
|
6369be0735
Implement GGML_CPU_ALL_VARIANTS for PowerPC (#14286)
|
7 months ago |
Georgi Gerganov
|
d27b3ca175
ggml : fix repack work size for mul_mat_id (#14292)
|
7 months ago |
xctan
|
860a9e4eef
ggml-cpu : remove the weak alias trick (#14221)
|
7 months ago |
xctan
|
3555b3004b
ggml-cpu : rework weak alias on apple targets (#14146)
|
7 months ago |
xctan
|
f470bc36be
ggml-cpu : split arch-specific implementations (#13892)
|
7 months ago |