Xuan-Son Nguyen
|
6df686bee6
server : refactor oai_parser_opt, move it to server_chat_params (#18937)
|
1 week ago |
Xuan-Son Nguyen
|
a04c2b06a3
server: improve slots scheduling for n_cmpl (#18789)
|
1 week ago |
Georgi Gerganov
|
39173bcacb
context : reserve new scheduler when graph topology changes (#18547)
|
1 week ago |
Xuan-Son Nguyen
|
9ac2693a30
server: fix n_cmpl not skipping processing prompt (#18663)
|
2 weeks ago |
R
|
3d26a09dc7
server : add thinking content blocks to Anthropic Messages API (#18551)
|
3 weeks ago |
Xuan-Son Nguyen
|
6ce863c803
server: prevent data race from HTTP threads (#18263)
|
1 month ago |
Xuan-Son Nguyen
|
6c2131773c
cli: new CLI experience (#17824)
|
1 month ago |
Xuan-Son Nguyen
|
951520ddb0
server: delegate result_state creation to server_task (#17835)
|
1 month ago |
Georgi Gerganov
|
2bc96931d2
server : make cache_reuse configurable per request (#17858)
|
1 month ago |
Xuan-Son Nguyen
|
c42712b056
server: support multiple generations from one prompt (OAI "n" option) (#17775)
|
1 month ago |
Xuan-Son Nguyen
|
c4c10bfb86
server: move msg diffs tracking to HTTP thread (#17740)
|
1 month ago |
Fredrik Hultin
|
ddf9f94389
server : add Anthropic Messages API support (#17570)
|
2 months ago |
Xuan-Son Nguyen
|
b8372eecd9
server: split server.cpp code into server/common/task/queue (#17362)
|
2 months ago |