Commit History

Author SHA1 Message Date
  Kawrakow bd2d4e393b 1.5 bit quantization (#5453) 1 year ago
  Georgi Gerganov 8f1be0d42f ggml : add ALiBi support for ggml_soft_max_ext (#5488) 1 year ago
  Ananta Bastola 6e4e973b26 ci : add an option to fail on compile warning (#3952) 1 year ago
  Ian Bull f026f8120f metal : use autoreleasepool to avoid memory leaks (#5437) 2 years ago
  Georgi Gerganov efb7bdbbd0 metal : add im2col F32 dst support (#5132) 2 years ago
  Georgi Gerganov 549a1e6cd5 ci : fix yolo URLs + fix metal capture (ggml/712) 2 years ago
  Jack Mousseau 5f14ee0b0c metal : add debug capture backend function (ggml/694) 2 years ago
  Kawrakow f4d7e54974 SOTA 3-bit quants (#5196) 2 years ago
  slaren fbe7dfa53c ggml : add max buffer sizes to opencl and metal backends (#5181) 2 years ago
  Paul Tsochantaris d2f650cb5b metal : free metal objects (#5161) 2 years ago
  0cc4m 2307523d32 ggml : add Vulkan backend (#2059) 2 years ago
  Paul Tsochantaris 6dd3c28c9c metal : remove unused `n_buffers` and `buffers` (#5129) 2 years ago
  Georgi Gerganov ddc5a5033f metal : show compile log messages 2 years ago
  Georgi Gerganov 26d607608d metal : disable support for MUL_MAT F32 x F16 2 years ago
  Paul Tsochantaris 1e605f4102 metal : fix memory leak, dangling pointer and unused autorel (#5007) 2 years ago
  Georgi Gerganov c918fe8dca metal : create autorelease pool during library build (#4970) 2 years ago
  Paul Tsochantaris 7563293665 metal : remove unnecessary nil check (#4986) 2 years ago
  Paul Tsochantaris 158f8c9e21 metal : localized logic in `ggml_metal_graph_compute` (#4924) 2 years ago
  Alex Azarov 3a48d558a6 metal : replace loop of dispatch_async with dispatch_apply (#4934) 2 years ago
  Alex Azarov 7c8d3abd1a metal : log `recommendedMaxWorkingSetSize` on iOS 16+ (#4936) 2 years ago
  Justine Tunney a0b3ac8c48 ggml : introduce GGML_CALL function annotation (#4850) 2 years ago
  Alex Azarov 5f5fe1bd60 metal : correctly set SIMD support flags on iOS (#4923) 2 years ago
  Georgi Gerganov 4be5ef556d metal : remove old API (#4919) 2 years ago
  Georgi Gerganov 2d57de5255 metal : disable log for loaded kernels (#4794) 2 years ago
  Georgi Gerganov b38b5e93ae metal : refactor kernel loading code (#4794) 2 years ago
  slaren e7e4df031b llama : ggml-backend integration (#4766) 2 years ago
  Kawrakow 49662cbed3 ggml : SOTA 2-bit quants (add IQ2_XS) (#4856) 2 years ago
  Paul Tsochantaris 2a7c94db5f metal : put encoder debug group behind a define (#4873) 2 years ago
  Georgi Gerganov 3267c2abc7 metal : fix deprecation warning (ggml/690) 2 years ago
  Jack Mousseau 5362e43962 metal : wrap each operation in debug group (ggml/690) 2 years ago