Georgi Gerganov
|
b730706a49
kv-cache : support layer reuse (#15504)
|
há 5 meses atrás |
Georgi Gerganov
|
715a6db02c
kv-cache : drop the "unified" prefix (#15467)
|
há 5 meses atrás |
Georgi Gerganov
|
d32e03f449
server : add SWA checkpoints (#15293)
|
há 5 meses atrás |
Georgi Gerganov
|
692e3cdd0a
memory : rename interface to llama_memory_context_i (#14296)
|
há 7 meses atrás |
Georgi Gerganov
|
4c9fdfbe15
ubatch : new splitting logic (#14217)
|
há 7 meses atrás |
Gabe Goodhart
|
edc4a29eff
memory : Hybrid recurrent cache (#13979)
|
há 7 meses atrás |