Commit History

Author SHA1 Message Date
  Georgi Gerganov c610b6c11b kv-cache : fix SWA checks + disable cacheless iSWA (#15811) 4 months ago
  Daniel Bevenius fb15d649ed llama : add support for EmbeddingGemma 300m (#15798) 4 months ago
  Georgi Gerganov b730706a49 kv-cache : support layer reuse (#15504) 5 months ago
  Georgi Gerganov 715a6db02c kv-cache : drop the "unified" prefix (#15467) 5 months ago
  Georgi Gerganov d32e03f449 server : add SWA checkpoints (#15293) 5 months ago
  compilade 11a3811164 memory : handle kv_unified for hybrid models (#15050) 5 months ago
  Diner Burger 496957e1cb llama : fix parameter order for hybrid memory initialization (#14725) 6 months ago
  Georgi Gerganov 225e7a1438 llama : add high-throughput mode (#14363) 6 months ago
  Georgi Gerganov 67d1ef23c6 batch : add optional for sequential equal split (#14511) 6 months ago
  Georgi Gerganov c79184d2d1 batch : add n_used count (#14512) 6 months ago
  Georgi Gerganov a70c8a0c4b kv-cache : use ggml_set_rows (#14285) 6 months ago
  Georgi Gerganov 745f11fed0 memory : correctly handle failure in apply() (#14438) 6 months ago
  Georgi Gerganov 692e3cdd0a memory : rename interface to llama_memory_context_i (#14296) 7 months ago
  Georgi Gerganov 4c9fdfbe15 ubatch : new splitting logic (#14217) 7 months ago
  Gabe Goodhart edc4a29eff memory : Hybrid recurrent cache (#13979) 7 months ago