Commit History

Author SHA1 Message Date
  Judd 36680f6e40 convert : update for baichuan (#2081) 2 years ago
  Johannes Gäßler 924dd22fd3 Quantized dot products for CUDA mul mat vec (#2067) 2 years ago
  Georgi Gerganov b472f3fca5 readme : add link web chat PR 2 years ago
  Judd 471aab6e4c convert : add support of baichuan-7b (#2055) 2 years ago
  Roman Parykin d38e451578 readme : add Scala 3 bindings repo (#2010) 2 years ago
  Gustavo Rocha Dias aa777abbb7 readme : LD_LIBRARY_PATH complement for some Android devices when building with CLBlast inside Termux (#2007) 2 years ago
  Georgi Gerganov 412c60e473 readme : add link to new k-quants for visibility 2 years ago
  Georgi Gerganov 447ccbe8c3 readme : add new roadmap + manifesto 2 years ago
  Georgi Gerganov 66a2555ba6 readme : add Azure CI discussion link 2 years ago
  Georgi Gerganov 11da1a85cd readme : fix whitespaces 2 years ago
  Alberto 235b610d65 readme : fixed termux instructions (#1973) 2 years ago
  eiery d7b7484f74 Add OpenLLaMA instructions to the README (#1954) 2 years ago
  Rahul Vivek Nair fb98254f99 Fix typo in README.md (#1961) 2 years ago
  Georgi Gerganov 049aa16b8c readme : add link to p1 2 years ago
  Xiake Sun 2322ec223a Fix typo (#1949) 2 years ago
  Johannes Gäßler 16b9cd1939 Convert vector to f16 for dequantize mul mat vec (#1913) 2 years ago
  Mike e1886cf4fe readme : update Android build instructions (#1922) 2 years ago
  Johannes Gäßler 2c9380dd2f Only one CUDA stream per device for async compute (#1898) 2 years ago
  Gustavo Rocha Dias bac19927c3 readme : alternative way to build for Android with CLBlast. (#1828) 2 years ago
  Aisuko 059e99066d doc : fix wrong address of BLIS.md (#1772) 2 years ago
  Georgi Gerganov 4dc62c545d readme : add June roadmap 2 years ago
  Yuval Peled f4c55d3bd7 docs : add performance troubleshoot + example benchmark documentation (#1674) 2 years ago
  Foul-Tarnished f1465624c2 readme : fix typo (#1700) 2 years ago
  Georgi Gerganov 827f5eda91 readme : update hot topics 2 years ago
  Georgi Gerganov ecb217db4f llama : Metal inference (#1642) 2 years ago
  Henri Vasserman d8bd0013e8 Add info about CUDA_VISIBLE_DEVICES (#1682) 2 years ago
  Henri Vasserman 97c9b77c4f Add documentation about CLBlast (#1604) 2 years ago
  Evan Jones c31bbe934b readme : add docs for chat-persistent.sh (#1568) 2 years ago
  Zenix b8ee340abe feature : support blis and other blas implementation (#1536) 2 years ago
  Georgi Gerganov ea600071cb Revert "feature : add blis and other BLAS implementation support (#1502)" 2 years ago