cturan/llama.cpp

Author	SHA1 Message	Date
Georgi Gerganov	2776db6c81 Revert "ggml-cpu: handle 3d tensors in repack mat_mul (#17030)" (#17233)	2 months ago
Alberto Cabrera Pérez	1c398dc9ec ggml-cpu: handle 3d tensors in repack mat_mul (#17030)	2 months ago
Noah	1f5accb8d0 Fix garbled output with REPACK at high thread counts (#16956)	2 months ago
Max Krasnyansky	517b7170e1 cpu: introduce chunking for repack matmuls and enable matmul-id chunking on ARM64 (#16833)	2 months ago
Georgi Gerganov	00f35d509e ggml : repack block_iq4_nlx8 (#14904)	5 months ago
Srihari-mcw	baad94885d ggml : Q2k interleaving implementation - x86/x64 SIMD (#14373)	5 months ago
Daniel Bevenius	5592f278b6 ggml-cpu : remove stdlib include from repack.cpp (ggml/1276)	6 months ago
Aaron Teo	60ef23d6c1 ggml-cpu: enable IBM NNPA Vector Intrinsics (#14317)	7 months ago
Christian Kastner	6369be0735 Implement GGML_CPU_ALL_VARIANTS for PowerPC (#14286)	7 months ago
Georgi Gerganov	d27b3ca175 ggml : fix repack work size for mul_mat_id (#14292)	7 months ago
xctan	860a9e4eef ggml-cpu : remove the weak alias trick (#14221)	7 months ago
xctan	3555b3004b ggml-cpu : rework weak alias on apple targets (#14146)	7 months ago
xctan	f470bc36be ggml-cpu : split arch-specific implementations (#13892)	7 months ago