Pierrick Hymbert
|
f482bb2e49
common: llama_load_model_from_url split support (#6192)
|
1 год назад |
Pierrick Hymbert
|
fd72d2d2a5
server: tests: add truncated prompt tests, better kv cache size (#5933)
|
1 год назад |
Georgi Gerganov
|
2002bc96bf
server : refactor (#5882)
|
1 год назад |
Pierrick Hymbert
|
9731134296
server: tests: passkey challenge / self-extend with context shift demo (#5832)
|
1 год назад |
Jorge A
|
efc72253f7
server : add "/chat/completions" alias for "/v1/...` (#5722)
|
1 год назад |
Pierrick Hymbert
|
9e359a4f47
server: continue to update other slots on embedding concurrent request (#5699)
|
1 год назад |
Pierrick Hymbert
|
525213d2f5
server: init functional tests (#5566)
|
1 год назад |