Xuan Son Nguyen
|
0da5d86026
server : allow using LoRA adapters per-request (#10994)
|
1 年之前 |
Xuan Son Nguyen
|
6c59567689
server : (tests) don't use thread for capturing stdout/stderr, bump openai client library (#10568)
|
1 年之前 |
Xuan Son Nguyen
|
45abe0f74e
server : replace behave with pytest (#10416)
|
1 年之前 |
vb
|
08a43d05b6
py : update transfomers version (#9694)
|
1 年之前 |
Xuan Son Nguyen
|
1e6f6554aa
server : add lora hotswap endpoint (WIP) (#8857)
|
1 年之前 |
compilade
|
3fd62a6b1c
py : type-check all Python scripts with Pyright (#8341)
|
1 年之前 |
Jared Van Bortel
|
bd60d82d0c
server tests : more pythonic process management; fix bare `except:` (#6146)
|
1 年之前 |
Pierrick Hymbert
|
d01b3c4c32
common: llama_load_model_from_url using --model-url (#6098)
|
1 年之前 |
Georgi Gerganov
|
2002bc96bf
server : refactor (#5882)
|
1 年之前 |
Pierrick Hymbert
|
9731134296
server: tests: passkey challenge / self-extend with context shift demo (#5832)
|
1 年之前 |
Pierrick Hymbert
|
d52d7819b8
server: concurrency fix + monitoring - add /metrics prometheus compatible endpoint (#5708)
|
1 年之前 |
Pierrick Hymbert
|
525213d2f5
server: init functional tests (#5566)
|
1 年之前 |