Commit History

Author SHA1 Message Date
  Georgi Gerganov 2776db6c81 Revert "ggml-cpu: handle 3d tensors in repack mat_mul (#17030)" (#17233) 2 months ago
  Alberto Cabrera Pérez 1c398dc9ec ggml-cpu: handle 3d tensors in repack mat_mul (#17030) 2 months ago
  Noah 1f5accb8d0 Fix garbled output with REPACK at high thread counts (#16956) 2 months ago
  Max Krasnyansky 517b7170e1 cpu: introduce chunking for repack matmuls and enable matmul-id chunking on ARM64 (#16833) 2 months ago
  Georgi Gerganov 00f35d509e ggml : repack block_iq4_nlx8 (#14904) 5 months ago
  Srihari-mcw baad94885d ggml : Q2k interleaving implementation - x86/x64 SIMD (#14373) 5 months ago
  Daniel Bevenius 5592f278b6 ggml-cpu : remove stdlib include from repack.cpp (ggml/1276) 6 months ago
  Aaron Teo 60ef23d6c1 ggml-cpu: enable IBM NNPA Vector Intrinsics (#14317) 7 months ago
  Christian Kastner 6369be0735 Implement GGML_CPU_ALL_VARIANTS for PowerPC (#14286) 7 months ago
  Georgi Gerganov d27b3ca175 ggml : fix repack work size for mul_mat_id (#14292) 7 months ago
  xctan 860a9e4eef ggml-cpu : remove the weak alias trick (#14221) 7 months ago
  xctan 3555b3004b ggml-cpu : rework weak alias on apple targets (#14146) 7 months ago
  xctan f470bc36be ggml-cpu : split arch-specific implementations (#13892) 7 months ago