Georgi Gerganov
|
7b50d589a8
kv-cells : fix tracking of seq_pos (#14339)
|
7 months ago |
Georgi Gerganov
|
4c9fdfbe15
ubatch : new splitting logic (#14217)
|
7 months ago |
Georgi Gerganov
|
c311ac664d
cparams : rename LLAMA_MAX_PARALLEL_SEQUENCES to LLAMA_MAX_SEQ (#14188)
|
7 months ago |
Georgi Gerganov
|
40cbf571c9
kv-cache : fix shift and defrag logic (#14081)
|
7 months ago |
Georgi Gerganov
|
12d0188c0d
kv-cache : refactor + add llama_memory_state_i (#13746)
|
8 months ago |
Georgi Gerganov
|
81713121ee
kv-cells : track min/max used cells and per-sequence positions (#13808)
|
8 months ago |
Georgi Gerganov
|
de2ef53a4b
kv-cache : rework kv_cell (#13706)
|
8 months ago |