Commit History

Author SHA1 Message Date
  Johannes Gäßler 1613ef8d8e CUDA: CUDART < 11.7 workaround for __hmax, __hmax2 (#7019) 1 year ago
  Georgi Gerganov 9c67c2773d ggml : add Flash Attention (#5021) 1 year ago
  Carolinabanana 5dc9dd7152 llama : add Command R Plus support (#6491) 1 year ago
  Georgi Gerganov d48ccf3ad4 sync : ggml (#6351) 1 year ago
  slaren ae1f211ce2 cuda : refactor into multiple files (#6269) 1 year ago