Xuan Son Nguyen
|
9b2c24c099
server : simplify state machine for slot (#9283)
|
1 год назад |
Xuan Son Nguyen
|
48baa61ecc
server : test script : add timeout for all requests (#9282)
|
1 год назад |
Xuan Son Nguyen
|
6e7d133a5f
server : refactor multitask handling (#9274)
|
1 год назад |
Pierrick Hymbert
|
f482bb2e49
common: llama_load_model_from_url split support (#6192)
|
1 год назад |
Pierrick Hymbert
|
fd72d2d2a5
server: tests: add truncated prompt tests, better kv cache size (#5933)
|
1 год назад |
Georgi Gerganov
|
2002bc96bf
server : refactor (#5882)
|
1 год назад |
Pierrick Hymbert
|
9731134296
server: tests: passkey challenge / self-extend with context shift demo (#5832)
|
1 год назад |
Jorge A
|
efc72253f7
server : add "/chat/completions" alias for "/v1/...` (#5722)
|
1 год назад |
Pierrick Hymbert
|
9e359a4f47
server: continue to update other slots on embedding concurrent request (#5699)
|
1 год назад |
Pierrick Hymbert
|
525213d2f5
server: init functional tests (#5566)
|
1 год назад |