Pierrick Hymbert
|
24ecb58168
Revert "server bench: fix bench not waiting for model load (#7284)" (#7334)
|
il y a 1 an |
Johannes Gäßler
|
583fd6b000
server bench: fix bench not waiting for model load (#7284)
|
il y a 1 an |
Georgi Gerganov
|
9c67c2773d
ggml : add Flash Attention (#5021)
|
il y a 1 an |
Pierrick Hymbert
|
75cd4c7729
ci: bench: support sse and fix prompt processing time / server: add tokens usage in stream OAI response (#6495)
|
il y a 1 an |
Pierrick Hymbert
|
7a2c92637a
ci: bench: add more ftype, fix triggers and bot comment (#6466)
|
il y a 1 an |
Pierrick Hymbert
|
a016026a3a
server: continuous performance monitoring and PR comment (#6283)
|
il y a 1 an |