Commit History

Autor SHA1 Mensaxe Data
  Georgi Gerganov 5c64a0952e k-quants : allow to optionally disable at compile time (#1734) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 2d43387daf ggml : fix builds, add ggml-quants-k.o (close #1712, close #1710) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 99009e72f8 ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov ecb217db4f llama : Metal inference (#1642) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 3b126f654f LLAMA_DEBUG adds debug symbols (#1617) %!s(int64=2) %!d(string=hai) anos
  Kerfuffle 0df7d63e5b Include server in releases + other build system cleanups (#1610) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 1fcdcc28b1 cuda : performance optimizations (#1530) %!s(int64=2) %!d(string=hai) anos
  0cc4m 2e6cd4b025 OpenCL Token Generation Acceleration (#1459) %!s(int64=2) %!d(string=hai) anos
  Stefan Sydow 7780e4f479 make : .PHONY clean (#1553) %!s(int64=2) %!d(string=hai) anos
  Zenix b8ee340abe feature : support blis and other blas implementation (#1536) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov ea600071cb Revert "feature : add blis and other BLAS implementation support (#1502)" %!s(int64=2) %!d(string=hai) anos
  Zenix 07e9ace0f9 feature : add blis and other BLAS implementation support (#1502) %!s(int64=2) %!d(string=hai) anos
  sandyiscool 2a5ee023ad Add alternate include path for openblas (#1476) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov bda4d7c215 make : fix PERF build with cuBLAS %!s(int64=2) %!d(string=hai) anos
  DaniAndTheWeb 173d0e6419 makefile: automatic Arch Linux detection (#1332) %!s(int64=2) %!d(string=hai) anos
  Ionoclast Laboratories 2d13786e91 Fix for OpenCL / clbast builds on macOS. (#1329) %!s(int64=2) %!d(string=hai) anos
  DannyDaemonic 55bc5f0900 Call sh on build-info.sh (#1294) %!s(int64=2) %!d(string=hai) anos
  DannyDaemonic f4cef87edf Add git-based build information for better issue tracking (#1232) %!s(int64=2) %!d(string=hai) anos
  Pavol Rusnak 6f79699286 build: add armv{6,7,8} support to cmake (#1251) %!s(int64=2) %!d(string=hai) anos
  Stephan Walter f0d70f147d Various fixes to mat_mul benchmark (#1253) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 214b6a3570 ggml : adjust mul_mat_f16 work memory (#1226) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 305eb5afd5 build : fix reference to old llama_util.h %!s(int64=2) %!d(string=hai) anos
  slaren 7fc50c051a cuBLAS: use host pinned memory and dequantize while copying (#1207) %!s(int64=2) %!d(string=hai) anos
  0cc4m 7296c961d9 ggml : add CLBlast support (#1164) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 92a6e13a31 Add Manjaro CUDA include and lib dirs to Makefile (#1212) %!s(int64=2) %!d(string=hai) anos
  slaren e4cf982e0d Fix cuda compilation (#1128) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov e4422e299c ggml : better PERF prints + support "LLAMA_PERF=1 make" %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 872c365a91 ggml : fix AVX build + update to new Q8_0 format %!s(int64=2) %!d(string=hai) anos
  slaren 50cb666b8a Improve cuBLAS performance by using a memory pool (#1094) %!s(int64=2) %!d(string=hai) anos
  slaren 2005469ea1 Add Q4_3 support to cuBLAS (#1086) %!s(int64=2) %!d(string=hai) anos