Commit History

Author SHA1 Message Date
  Georgi Gerganov 81713121ee kv-cells : track min/max used cells and per-sequence positions (#13808) 8 months ago
  Georgi Gerganov de2ef53a4b kv-cache : rework kv_cell (#13706) 8 months ago
  Georgi Gerganov 797f2ac062 kv-cache : simplify the interface (#13660) 8 months ago
  Georgi Gerganov a4090d1174 llama : remove llama_kv_cache_view API + remove deprecated (#13653) 8 months ago
  Georgi Gerganov e298d2fbd0 kv-cache : add SWA support (#13194) 8 months ago
  Georgi Gerganov e3a9421b78 kv-cache : fix out-of-bounds view during reserve graph (#13547) 8 months ago
  Georgi Gerganov c642bc014c kv-cache : separate recurrent vs non-recurrent impl (#12799) 8 months ago
  Juk Armstrong daa422881a llama : DeepSeek V2/V3 MLA implementation (#12801) 9 months ago
  Georgi Gerganov 3e1d29348b kv-cache : simplify + fix warning for recurrent models (#12756) 9 months ago
  Georgi Gerganov a10b36c91a llama : refactor kv cache guard (#12695) 9 months ago
  Georgi Gerganov e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) 10 months ago
  Georgi Gerganov afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 year ago
  Daniel Bevenius 6369f867a4 llama : rename missed batch params/vars to ubatch (#10059) 1 year ago
  Georgi Gerganov f66f582927 llama : refactor `src/llama.cpp` (#10902) 1 year ago