Commit History

Author SHA1 Message Date
  Pierrick Hymbert 24ecb58168 Revert "server bench: fix bench not waiting for model load (#7284)" (#7334) 1 year ago
  Johannes Gäßler 583fd6b000 server bench: fix bench not waiting for model load (#7284) 1 year ago
  Georgi Gerganov 9c67c2773d ggml : add Flash Attention (#5021) 1 year ago
  Pierrick Hymbert 75cd4c7729 ci: bench: support sse and fix prompt processing time / server: add tokens usage in stream OAI response (#6495) 1 year ago
  Pierrick Hymbert 7a2c92637a ci: bench: add more ftype, fix triggers and bot comment (#6466) 1 year ago
  Pierrick Hymbert a016026a3a server: continuous performance monitoring and PR comment (#6283) 1 year ago