Commit History

Author SHA1 Message Date
  Rickard Hallerbäck dc6897404e metal : reusing llama.cpp logging (#3152) 2 years ago
  Georgi Gerganov 8c00b7a6ff sync : ggml (Metal F32 support + reduce ggml-alloc size) (#3192) 2 years ago
  Georgi Gerganov a51b687657 metal : relax conditions on fast matrix multiplication kernel (#3168) 2 years ago
  Kawrakow f31b6f4e2d metal : PP speedup (#3084) 2 years ago
  kchro3 21ac3a1503 metal : support for Swift (#3078) 2 years ago
  Jhen-Jie Hong 4fd5477955 metal : support build for iOS/tvOS (#3089) 2 years ago
  Kawrakow be8c9c245b metal : parallel RoPE on Metal (#3024) 2 years ago
  Przemysław Pawełczyk fec2fb19e4 ggml : posixify madvise and pagesize (#3037) 2 years ago
  Kawrakow ca82cf7bac metal : more optimizations (#2959) 2 years ago
  Karsten Weiss 8b56b4f2c3 metal : show all Metal device instances in the system (#2952) 2 years ago
  Georgi Gerganov 13268c5331 metal : slight speed-up for add and mul kernels (#2917) 2 years ago
  Kawrakow e8d9158925 metal: somewhat faster f16 x f32 matrix multiply kernel (#2951) 2 years ago
  Georgi Gerganov 3a007648f2 metal : add option to disable debug logs (close #2764) 2 years ago
  Georgi Gerganov f55538c3cc metal : fix memory leak (#2762) 2 years ago
  Georgi Gerganov d67777c202 metal : add Q8_0 support (#2763) 2 years ago
  Georgi Gerganov cf658adc83 llm : add Falcon support (#2717) 2 years ago
  Georgi Gerganov 6381d4e110 gguf : new file format with flexible meta data (beta) (#2398) 2 years ago
  Jhen-Jie Hong ed53db86c3 metal : print error of load pipeline state (#2564) 2 years ago
  Shouzheng Liu fc8ef549e5 metal : enable ggml-alloc (#2627) 2 years ago
  Shouzheng Liu bf83bff674 metal : matrix-matrix multiplication kernel (#2615) 2 years ago
  Jhen-Jie Hong d783f7982e metal : return null instead of exit(1) (#2573) 2 years ago
  Georgi Gerganov f6f9896ac3 metal : fix out-of-bounds access + inc concurrency nodes (#2416) 2 years ago
  Matteo Boschini 1873ff586b metal : add gqa8 kernel to allow llama-2-70B on metal (#2459) 2 years ago
  Shouzheng Liu 1aa18ef994 metal : concurrently dispatch commands (#2358) 2 years ago
  slaren 41c674161f make rms_norm_eps a parameter (#2374) 2 years ago
  Georgi Gerganov 5b2b2dc6ae ggml : sync (unary ops refactor, static-correctness) (#2370) 2 years ago
  slaren 95a6c595e7 ggml: move op parameters from tensors to ggml_tensor::op_params (#2333) 2 years ago
  Jiahao Li 83a00ce69b metal : support bcast add & dup & cont op (#2323) 2 years ago
  Kawrakow 4d76a5f49b Faster Q3_K implementation on Metal (#2307) 2 years ago
  Kawrakow e68c96f7fe Faster Q2_K on Metal (#2297) 2 years ago