Historique des commits

Auteur SHA1 Message Date
  Georgi Gerganov 00131d6eaf tests : update for LLAMA_SET_ROWS=1 (#14961) il y a 5 mois
  Georgi Gerganov 745aa5319b llama : deprecate llama_kv_self_ API (#14030) il y a 7 mois
  Georgi Gerganov e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) il y a 10 mois
  Georgi Gerganov afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) il y a 1 an
  Georgi Gerganov f66f582927 llama : refactor `src/llama.cpp` (#10902) il y a 1 an
  Georgi Gerganov d9d54e498d speculative : refactor and add a simpler example (#10362) il y a 1 an
  Georgi Gerganov 55e47786e3 llama : default sampling changes + greedy update (#9897) il y a 1 an
  Xuan Son Nguyen cda0e4b648 llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745) il y a 1 an
  Diego Devesa 7eee341bee common : use common_ prefix for common library functions (#9805) il y a 1 an
  Xuan Son Nguyen bfe76d4a17 common : move arg parser code to `arg.cpp` (#9388) il y a 1 an
  slaren 5fb5e24811 llama : minor sampling refactor (2) (#9386) il y a 1 an
  Xuan Son Nguyen 1b9ae5189c common : refactor arg parser (#9308) il y a 1 an
  Georgi Gerganov df270ef745 llama : refactor sampling v2 (#9294) il y a 1 an
  Liu Jia 0a4ce78681 common : Changed tuple to struct (TODO fix) (#8823) il y a 1 an
  compilade 4c676c85e5 llama : refactor session file management (#8699) il y a 1 an
  Georgi Gerganov 1442677f92 common : refactor cli arg parsing (#7675) il y a 1 an
  Jan Boon beea6e1b16 llama : save and restore kv cache for single seq id (#6341) il y a 1 an
  David Friehs df845cc982 llama : minimize size used for state save/load (#4820) il y a 2 ans
  cebtenzzre b12fa0d1c1 build : link against build info instead of compiling against it (#3879) il y a 2 ans
  Georgi Gerganov 1142013da4 save-load-state : fix example + add ci test (#3655) il y a 2 ans
  Kerfuffle 70c29da118 common : fix mirostat state when using multiple sequences (#3543) il y a 2 ans
  slaren 16bc66d947 llama.cpp : split llama_context_params into model and context params (#3301) il y a 2 ans
  Georgi Gerganov ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228) il y a 2 ans
  Cebtenzzre 8781013ef6 make : restore build-info.h dependency for several targets (#3205) il y a 2 ans
  Cebtenzzre e6616cf0db examples : add compiler version and target to build info (#2998) il y a 2 ans
  Cebtenzzre 00d62adb79 fix some warnings from gcc and clang-tidy (#3038) il y a 2 ans
  Georgi Gerganov edd4c14817 llama : more tokenizer fixes (#2810) il y a 2 ans
  Georgi Gerganov 6381d4e110 gguf : new file format with flexible meta data (beta) (#2398) il y a 2 ans
  Rand Xie 65cdf34bdc llama : use n_embd_gqa instead of n_embd to handle llama-2 70B (#2433) il y a 2 ans
  Didzis Gosko 527b6fba1d llama : make model stateless and context stateful (llama_state) (#1797) il y a 2 ans