Commit History

Autor SHA1 Mensaxe Data
  Johannes Gäßler b772bba42e CUDA: fixed cmake F16 option (#2471) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 0728c5a8b9 CUDA: mmq CLI option, fixed mmq build issues (#2453) %!s(int64=2) %!d(string=hai) anos
  slaren a113689571 ggml : add graph tensor allocator (#2411) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 11f3ca06b8 CUDA: Quantized matrix matrix multiplication (#2160) %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre 6df1f5940f make : build with -Wmissing-prototypes (#2394) %!s(int64=2) %!d(string=hai) anos
  wzy 78a3d13424 flake : remove intel mkl from flake.nix due to missing files (#2277) %!s(int64=2) %!d(string=hai) anos
  wzy 45a1b07e9b flake : update flake.nix (#2270) %!s(int64=2) %!d(string=hai) anos
  wzy b1f4290953 cmake : install targets (#2256) %!s(int64=2) %!d(string=hai) anos
  Howard Su 4e7464ef88 FP16 is supported in CM=6.0 (#2177) %!s(int64=2) %!d(string=hai) anos
  Evan Miller 5656d10599 mpi : add support for distributed inference via MPI (#2099) %!s(int64=2) %!d(string=hai) anos
  clyang 3bbc1a11f0 ggml : fix buidling with Intel MKL but ask for "cblas.h" issue (#2104) (#2115) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 924dd22fd3 Quantized dot products for CUDA mul mat vec (#2067) %!s(int64=2) %!d(string=hai) anos
  Tobias Lütke 7ee76e45af Simple webchat for server (#1998) %!s(int64=2) %!d(string=hai) anos
  Daniel Drake b213227067 cmake : don't force -mcpu=native on aarch64 (#2063) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 6769e944c7 k-quants : support for super-block size of 64 (#2001) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler bbca06e269 cmake: revert CUDA arch default to 52, 61 if f16 (#1959) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 23fc5c219a cmake : fix trailing whitespaces %!s(int64=2) %!d(string=hai) anos
  Howard Su 1e3abfcef0 cmake : fix build shared ggml when CUDA is enabled (#1929) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 16b9cd1939 Convert vector to f16 for dequantize mul mat vec (#1913) %!s(int64=2) %!d(string=hai) anos
  Howard Su 57cd69460f cmake : add CUDA_ARCHITECTURES to new target ggml_static (#1917) %!s(int64=2) %!d(string=hai) anos
  Kerfuffle b4c6f46f17 Allow cmake to build ggml as a library (#1896) %!s(int64=2) %!d(string=hai) anos
  Zenix 13fe9d2d84 cmake : add auto detection of BLAS_INCLUDE_DIRS (#1886) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 3d01122610 CUDA : faster k-quant dot kernels (#1862) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov bed9275617 cmake : remove whitespaces %!s(int64=2) %!d(string=hai) anos
  Igor Okulist 3559433fec cmake : set include path for OpenBlas (#1830) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 4de0334f5c cmake : fix Metal build (close #1791) %!s(int64=2) %!d(string=hai) anos
  Andrei 303f5809f1 metal : fix issue with ggml-metal.metal path. Closes #1769 (#1782) %!s(int64=2) %!d(string=hai) anos
  johnson442 0035858273 k-quants : add missing compile definition to CMakeLists (#1748) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 5c64a0952e k-quants : allow to optionally disable at compile time (#1734) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 99009e72f8 ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684) %!s(int64=2) %!d(string=hai) anos