Commit History

Autor SHA1 Mensaxe Data
  anzz1 2f7bf7dd7c CMake / CI additions (#497) %!s(int64=2) %!d(string=hai) anos
  anzz1 34ab526843 (Windows) Set console to UTF-8 on init (#420) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov c2b25b6912 Fix colors enabling on WIN32 %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 79b2b266db If n_predict == -1, generate forever %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov e2d490dafd Inifinite generation via context swapping (#71) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 03f7e33560 Cleanup STL headers + fix embedding examples + minor stuff %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 55ad42af84 Move chat scripts into "./examples" %!s(int64=2) %!d(string=hai) anos
  slaren 459e93cce0 Add AVX2 implementation of dequantize_row_q4_1 (#505) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov a316a425d0 Overhaul the examples structure %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov ecbe466a36 Retire the ggml_mul_mat() branch for transposed src0 (#500) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 502a400192 Disable prompt verbosity by default and add option to enable (#480) %!s(int64=2) %!d(string=hai) anos
  slaren 09aecbf628 Add AVX2 implementation of dequantize_row_q4_0 (#467) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 4640eff23d Don't interefe with BLAS for large prompts by running only 1 thread %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov ab77d76312 Add longer DAN prompt for testing big batch numbers %!s(int64=2) %!d(string=hai) anos
  slaren 29b7baab67 Add timings for the prompt evaluation (#478) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 4a7129acd2 Remove obsolete information from README %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 6b6dbc8910 Remove obsolete assert and fix compiler warning %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 2a2e63ce05 Fix nasty bug in ggml_compute_forward_mul_mat_f32() and reenable BLAS %!s(int64=2) %!d(string=hai) anos
  anzz1 e899bf54b2 bounds checking for input prefix (#492) %!s(int64=2) %!d(string=hai) anos
  anzz1 fbd4d38c64 feat: '--in-prefix STRING' option (#426) %!s(int64=2) %!d(string=hai) anos
  Jed Fox 58e6c9f36f Add support for file load progress reporting callbacks (#434) %!s(int64=2) %!d(string=hai) anos
  Doomsdayrs 36d07532ef Add missing struct annotation (#483) %!s(int64=2) %!d(string=hai) anos
  Chris Kuehl 6f1ee4b640 Fix crash for 65B model with pre-allocated memory (#485) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 8520fc310e Disable BLAS altogether - the bug is not just for qunatized mat mul %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov b3f460e941 Disable BLAS branch in mul_mat - seems there is a bug %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 04c6f5ed6f Immediately start processing the prompt before user input has been provided (#476) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 7a9b6c3a8b Reduce memory usage and allocate enough memory for largest context (#473) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 31572d9665 Temporary bump the memory buffer size - hopefully fix issues from 483bab2e %!s(int64=2) %!d(string=hai) anos
  Gary Mulder f4f5362edb Update README.md (#444) %!s(int64=2) %!d(string=hai) anos
  rabidcopy 863f65e2e3 fix instruct mode (#445) %!s(int64=2) %!d(string=hai) anos