Commit History

Author SHA1 Message Date
  Georgi Gerganov 7b50d589a8 kv-cells : fix tracking of seq_pos (#14339) 7 months ago
  Georgi Gerganov 4c9fdfbe15 ubatch : new splitting logic (#14217) 7 months ago
  Georgi Gerganov c311ac664d cparams : rename LLAMA_MAX_PARALLEL_SEQUENCES to LLAMA_MAX_SEQ (#14188) 7 months ago
  Georgi Gerganov 40cbf571c9 kv-cache : fix shift and defrag logic (#14081) 7 months ago
  Georgi Gerganov 12d0188c0d kv-cache : refactor + add llama_memory_state_i (#13746) 8 months ago
  Georgi Gerganov 81713121ee kv-cells : track min/max used cells and per-sequence positions (#13808) 8 months ago
  Georgi Gerganov de2ef53a4b kv-cache : rework kv_cell (#13706) 8 months ago