Commit History

Author SHA1 Message Date
  Georgi Gerganov 4bfcc855ab metal : parallel command buffer encoding (#1860) 2 years ago
  Kawrakow 74a6d922f1 Metal implementation for all k_quants (#1807) 2 years ago
  Kawrakow 8c0a10e64d metal : fix failure to load model (#1817) 2 years ago
  Andrei 303f5809f1 metal : fix issue with ggml-metal.metal path. Closes #1769 (#1782) 2 years ago
  Kawrakow e9b66ee982 metal : add Q4_1 implementation (#1785) 2 years ago
  AT 92f44ff7f7 metal : add GELU implementation (#1770) 2 years ago
  Kawrakow 245fc3c37d metal : faster q4_0 (#1775) 2 years ago
  Kawrakow 72ff5282bf metal : add Q2_K implementation (#1762) 2 years ago
  Kawrakow 0f291e1f65 metal : Q6_K implementation (#1752) 2 years ago
  Kawrakow 4161bdc04d metal : add Q4_K implementation (#1733) 2 years ago
  Georgi Gerganov 44f906e853 metal : add f16 support 2 years ago
  Spencer Sutton 590250f7a9 metal : add checks for buffer size (#1706) 2 years ago
  kiltyj 9d0693bce3 metal : use shared buffers between CPU and GPU (#1696) 2 years ago
  Georgi Gerganov ecb217db4f llama : Metal inference (#1642) 2 years ago