Commit History

Autor SHA1 Mensaxe Data
  Xuan Son Nguyen bfe76d4a17 common : move arg parser code to `arg.cpp` (#9388) hai 1 ano
  slaren 5fb5e24811 llama : minor sampling refactor (2) (#9386) hai 1 ano
  Xuan Son Nguyen 1b9ae5189c common : refactor arg parser (#9308) hai 1 ano
  Georgi Gerganov df270ef745 llama : refactor sampling v2 (#9294) hai 1 ano
  Liu Jia 0a4ce78681 common : Changed tuple to struct (TODO fix) (#8823) hai 1 ano
  compilade 4c676c85e5 llama : refactor session file management (#8699) hai 1 ano
  Georgi Gerganov 1442677f92 common : refactor cli arg parsing (#7675) hai 1 ano
  Jan Boon beea6e1b16 llama : save and restore kv cache for single seq id (#6341) hai 1 ano
  David Friehs df845cc982 llama : minimize size used for state save/load (#4820) %!s(int64=2) %!d(string=hai) anos
  cebtenzzre b12fa0d1c1 build : link against build info instead of compiling against it (#3879) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 1142013da4 save-load-state : fix example + add ci test (#3655) %!s(int64=2) %!d(string=hai) anos
  Kerfuffle 70c29da118 common : fix mirostat state when using multiple sequences (#3543) %!s(int64=2) %!d(string=hai) anos
  slaren 16bc66d947 llama.cpp : split llama_context_params into model and context params (#3301) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228) %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre 8781013ef6 make : restore build-info.h dependency for several targets (#3205) %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre e6616cf0db examples : add compiler version and target to build info (#2998) %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre 00d62adb79 fix some warnings from gcc and clang-tidy (#3038) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov edd4c14817 llama : more tokenizer fixes (#2810) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 6381d4e110 gguf : new file format with flexible meta data (beta) (#2398) %!s(int64=2) %!d(string=hai) anos
  Rand Xie 65cdf34bdc llama : use n_embd_gqa instead of n_embd to handle llama-2 70B (#2433) %!s(int64=2) %!d(string=hai) anos
  Didzis Gosko 527b6fba1d llama : make model stateless and context stateful (llama_state) (#1797) %!s(int64=2) %!d(string=hai) anos
  Borislav Stanimirov 9cbf50c041 build : fix and ignore MSVC warnings (#1889) %!s(int64=2) %!d(string=hai) anos
  Stephan Walter dc271c52ed Remove unused n_parts parameter (#1509) %!s(int64=2) %!d(string=hai) anos
  András Salamon 9560655409 define default model path once, sync path with readme (#1366) %!s(int64=2) %!d(string=hai) anos
  DannyDaemonic f4cef87edf Add git-based build information for better issue tracking (#1232) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 305eb5afd5 build : fix reference to old llama_util.h %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 84ca9c2ecf examples : fix save-load-state + rename llama-util.h %!s(int64=2) %!d(string=hai) anos
  Ivan Stepanov dd7eff57d8 llama : new sampling algorithms (#1126) %!s(int64=2) %!d(string=hai) anos
  xaedes 0c5692345d examples : add save_load_state example (#1150) %!s(int64=2) %!d(string=hai) anos