Commit History

Author SHA1 Message Date
  Georgi Gerganov 8520fc310e Disable BLAS altogether - the bug is not just for qunatized mat mul 2 years ago
  Georgi Gerganov b3f460e941 Disable BLAS branch in mul_mat - seems there is a bug 2 years ago
  Georgi Gerganov 04c6f5ed6f Immediately start processing the prompt before user input has been provided (#476) 2 years ago
  Georgi Gerganov 7a9b6c3a8b Reduce memory usage and allocate enough memory for largest context (#473) 2 years ago
  Georgi Gerganov 31572d9665 Temporary bump the memory buffer size - hopefully fix issues from 483bab2e 2 years ago
  Gary Mulder f4f5362edb Update README.md (#444) 2 years ago
  rabidcopy 863f65e2e3 fix instruct mode (#445) 2 years ago
  Georgi Gerganov afd220d9c6 Properly free llama_context on failure 2 years ago
  Cameron Kaiser 481044d50c additional optimizations for POWER9 (#454) 2 years ago
  comex 563cdc391d Support calling mlock() on loaded model data on Linux and macOS (#453) 2 years ago
  Luciano 8d4a855c24 Add embedding mode with arg flag. Currently working (#282) 2 years ago
  Georgi Gerganov b6b268d441 Add link to Roadmap discussion 2 years ago
  Georgi Gerganov 3cd8dde0d1 Revert "Fix memory allocation issues and seg faults" 2 years ago
  Georgi Gerganov 4870e455b3 Fix memory allocation issues and seg faults 2 years ago
  Georgi Gerganov 483bab2e3d Avoid the transposed X branch in the Z = X * Y matrix multiplication (#439) 2 years ago
  Jed Fox 404e1da38e Fix quantize script not finding models in parent directory (#428) 2 years ago
  Georgi Gerganov 4cc053b6d5 Remove oboslete command from Docker script 2 years ago
  Georgi Gerganov 0ba5a3a9a5 Obsolete 2 years ago
  rabidcopy 2e17dfd80a Replace EOS with newline to prevent context/memory being flushed by EOS in interactive mode (#333) 2 years ago
  Timmy Knight 20a1a4e09c Fix GPTQ converter (#423) 2 years ago
  nusu-github ad072fc5ad Generate library with CMake (#430) 2 years ago
  anzz1 ea10d3ded2 Command line args bounds checking (#424) 2 years ago
  Ben Siraphob a18c19259a Fix Nix build 2 years ago
  Stephan Walter a50e39c6fe Revert "Delete SHA256SUMS for now" (#429) 2 years ago
  Kerfuffle a140219e81 Fix Makefile echo escape codes (by removing them). (#418) 2 years ago
  Gary Mulder 8a3e5ef801 Move model section from issue template to README.md (#421) 2 years ago
  anzz1 8eea5ae0e5 Delete SHA256SUMS for now (#416) 2 years ago
  Georgi Gerganov 93208cfb92 Adjust repetition penalty .. 2 years ago
  Georgi Gerganov 03ace14cfd Add link to recent podcast about whisper.cpp and llama.cpp 2 years ago
  anzz1 e4412b45e3 CI: CMake: Separate build and test steps (#376) 2 years ago