Georgi Gerganov
|
6262d13e0b
common : reimplement logging (#9418)
|
преди 1 година |
compilade
|
3fd62a6b1c
py : type-check all Python scripts with Pyright (#8341)
|
преди 1 година |
Olivier Chafik
|
1c641e6aac
`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)
|
преди 1 година |
Pierrick Hymbert
|
24ecb58168
Revert "server bench: fix bench not waiting for model load (#7284)" (#7334)
|
преди 1 година |
Johannes Gäßler
|
583fd6b000
server bench: fix bench not waiting for model load (#7284)
|
преди 1 година |
Georgi Gerganov
|
9c67c2773d
ggml : add Flash Attention (#5021)
|
преди 1 година |
Pierrick Hymbert
|
75cd4c7729
ci: bench: support sse and fix prompt processing time / server: add tokens usage in stream OAI response (#6495)
|
преди 1 година |
Pierrick Hymbert
|
7a2c92637a
ci: bench: add more ftype, fix triggers and bot comment (#6466)
|
преди 1 година |
Pierrick Hymbert
|
a016026a3a
server: continuous performance monitoring and PR comment (#6283)
|
преди 1 година |