Commit History

Author SHA1 Message Date
  Georgi Gerganov d3e64b9f49 llama : rework embeddings logic (#14208) 7 months ago
  Georgi Gerganov c3ee46fab4 batch : remove logits_all flag (#14141) 7 months ago
  compilade dad5c44398 kv-cache : avoid modifying recurrent cells when setting inputs (#13834) 7 months ago
  Georgi Gerganov 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 months ago
  Georgi Gerganov 3e63a58ef7 kv-cache : refactor the update/defrag mechanism (#13988) 7 months ago
  Georgi Gerganov 0fc16b42e8 kv-cache : split implementation in separate sources (#13920) 7 months ago