Johannes Gäßler
|
e789095502
llama: print memory breakdown on exit (#15860)
|
пре 3 месеци |
Georgi Gerganov
|
c610b6c11b
kv-cache : fix SWA checks + disable cacheless iSWA (#15811)
|
пре 4 месеци |
Daniel Bevenius
|
fb15d649ed
llama : add support for EmbeddingGemma 300m (#15798)
|
пре 4 месеци |
Georgi Gerganov
|
b730706a49
kv-cache : support layer reuse (#15504)
|
пре 4 месеци |
Georgi Gerganov
|
715a6db02c
kv-cache : drop the "unified" prefix (#15467)
|
пре 5 месеци |
Georgi Gerganov
|
d32e03f449
server : add SWA checkpoints (#15293)
|
пре 5 месеци |
compilade
|
11a3811164
memory : handle kv_unified for hybrid models (#15050)
|
пре 5 месеци |
Diner Burger
|
496957e1cb
llama : fix parameter order for hybrid memory initialization (#14725)
|
пре 6 месеци |
Georgi Gerganov
|
225e7a1438
llama : add high-throughput mode (#14363)
|
пре 6 месеци |
Georgi Gerganov
|
67d1ef23c6
batch : add optional for sequential equal split (#14511)
|
пре 6 месеци |
Georgi Gerganov
|
c79184d2d1
batch : add n_used count (#14512)
|
пре 6 месеци |
Georgi Gerganov
|
a70c8a0c4b
kv-cache : use ggml_set_rows (#14285)
|
пре 6 месеци |
Georgi Gerganov
|
745f11fed0
memory : correctly handle failure in apply() (#14438)
|
пре 6 месеци |
Georgi Gerganov
|
692e3cdd0a
memory : rename interface to llama_memory_context_i (#14296)
|
пре 7 месеци |
Georgi Gerganov
|
4c9fdfbe15
ubatch : new splitting logic (#14217)
|
пре 7 месеци |
Gabe Goodhart
|
edc4a29eff
memory : Hybrid recurrent cache (#13979)
|
пре 7 месеци |