Georgi Gerganov
|
d3e64b9f49
llama : rework embeddings logic (#14208)
|
7 months ago |
Georgi Gerganov
|
c3ee46fab4
batch : remove logits_all flag (#14141)
|
7 months ago |
compilade
|
dad5c44398
kv-cache : avoid modifying recurrent cells when setting inputs (#13834)
|
7 months ago |
Georgi Gerganov
|
745aa5319b
llama : deprecate llama_kv_self_ API (#14030)
|
7 months ago |
Georgi Gerganov
|
3e63a58ef7
kv-cache : refactor the update/defrag mechanism (#13988)
|
7 months ago |
Georgi Gerganov
|
0fc16b42e8
kv-cache : split implementation in separate sources (#13920)
|
7 months ago |