Xuan-Son Nguyen
|
f896d2c34f
server: improve speed of speculative decoding (#17808)
|
1 month ago |
Xuan-Son Nguyen
|
c42712b056
server: support multiple generations from one prompt (OAI "n" option) (#17775)
|
1 month ago |
Xuan-Son Nguyen
|
13628d8bdb
server: add --media-path for local media files (#17697)
|
1 month ago |
Xuan-Son Nguyen
|
5d6bd842ea
server: remove default "gpt-3.5-turbo" model name (#17668)
|
1 month ago |
Fredrik Hultin
|
ddf9f94389
server : add Anthropic Messages API support (#17570)
|
1 month ago |
Xuan-Son Nguyen
|
b8372eecd9
server: split server.cpp code into server/common/task/queue (#17362)
|
1 month ago |