Commit History

Autor SHA1 Mensaxe Data
  Aarni Koskela b3f138d058 Chat UI extras (#2366) %!s(int64=2) %!d(string=hai) anos
  Evan Jones 84e09a7d8b llama : add grammar-based sampling (#1773) %!s(int64=2) %!d(string=hai) anos
  Jose Maldonado 91171b8072 make : fix CLBLAST compile support in FreeBSD (#2331) %!s(int64=2) %!d(string=hai) anos
  Jose Maldonado 73643f5fb1 gitignore : changes for Poetry users + chat examples (#2284) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov a814d04f81 make : fix indentation %!s(int64=2) %!d(string=hai) anos
  Sky Yan 42c7c2e2e9 make : support customized LLAMA_CUDA_NVCC and LLAMA_CUDA_CCBIN (#2275) %!s(int64=2) %!d(string=hai) anos
  Jiří Podivín 54e3bc76fe make : add new target for test binaries (#2244) %!s(int64=2) %!d(string=hai) anos
  Przemysław Pawełczyk 9cf022a188 make : fix embdinput library and server examples building on MSYS2 (#2235) %!s(int64=2) %!d(string=hai) anos
  wzy 7dabc66f3c make : use pkg-config for OpenBLAS (#2222) %!s(int64=2) %!d(string=hai) anos
  James Reynolds 229aab351c make : fix combination of LLAMA_METAL and LLAMA_MPI (#2208) %!s(int64=2) %!d(string=hai) anos
  Evan Miller 5656d10599 mpi : add support for distributed inference via MPI (#2099) %!s(int64=2) %!d(string=hai) anos
  dylan 84525e7962 docker : add support for CUDA in docker (#1461) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 924dd22fd3 Quantized dot products for CUDA mul mat vec (#2067) %!s(int64=2) %!d(string=hai) anos
  Henri Vasserman acc111caf9 Allow old Make to build server. (#2098) %!s(int64=2) %!d(string=hai) anos
  ZhouYuChen 23c7c6fc91 Update Makefile: clean simple (#2097) %!s(int64=2) %!d(string=hai) anos
  ningshanwutuobang cfa0750bc9 llama : support input embeddings directly (#1910) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 6769e944c7 k-quants : support for super-block size of 64 (#2001) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 16b9cd1939 Convert vector to f16 for dequantize mul mat vec (#1913) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov ce2c7d72e2 metal : handle buffers larger than device's maxBufferLength (#1826) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov b2416493ab make : do not print help for simple example %!s(int64=2) %!d(string=hai) anos
  DaniAndTheWeb 86c7571864 make : update for latest Arch (#1701) %!s(int64=2) %!d(string=hai) anos
  Randall Fitzgerald 794db3e7b9 Server Example Refactor and Improvements (#1570) %!s(int64=2) %!d(string=hai) anos
  SuperUserNameMan b41b4cad6f examples : add "simple" (#1840) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 3d01122610 CUDA : faster k-quant dot kernels (#1862) %!s(int64=2) %!d(string=hai) anos
  daboe01 cf267d1c71 make : add train-text-from-scratch (#1850) %!s(int64=2) %!d(string=hai) anos
  sandyiscool 37e257c48e make : clean *.so files (#1857) %!s(int64=2) %!d(string=hai) anos
  Kerfuffle 74d4cfa343 Allow "quantizing" to f16 and f32 (#1787) %!s(int64=2) %!d(string=hai) anos
  rankaiyx 555275a693 make : add SSSE3 compilation use case (#1659) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 5c64a0952e k-quants : allow to optionally disable at compile time (#1734) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 2d43387daf ggml : fix builds, add ggml-quants-k.o (close #1712, close #1710) %!s(int64=2) %!d(string=hai) anos