Georgi Gerganov
|
d32e03f449
server : add SWA checkpoints (#15293)
|
5 месяцев назад |
l3utterfly
|
7233358d29
memory : handle saving/loading null layers in recurrent memory (#14675)
|
5 месяцев назад |
Georgi Gerganov
|
01612b7409
llama : reuse compute graphs (#14482)
|
6 месяцев назад |
compilade
|
4a5686da22
llama : support Jamba hybrid Transformer-Mamba models (#7531)
|
6 месяцев назад |
compilade
|
bb4f7a9e4e
memory : fix broken batch splits for recurrent cache (#14575)
|
6 месяцев назад |
Georgi Gerganov
|
67d1ef23c6
batch : add optional for sequential equal split (#14511)
|
6 месяцев назад |
Georgi Gerganov
|
c79184d2d1
batch : add n_used count (#14512)
|
6 месяцев назад |
Georgi Gerganov
|
745f11fed0
memory : correctly handle failure in apply() (#14438)
|
6 месяцев назад |
Georgi Gerganov
|
43678060c1
recurrent : call balloc split_reset() in init_batch() (#14414)
|
6 месяцев назад |
Georgi Gerganov
|
692e3cdd0a
memory : rename interface to llama_memory_context_i (#14296)
|
7 месяцев назад |
Georgi Gerganov
|
4c9fdfbe15
ubatch : new splitting logic (#14217)
|
7 месяцев назад |
Gabe Goodhart
|
edc4a29eff
memory : Hybrid recurrent cache (#13979)
|
7 месяцев назад |