Xuan Son Nguyen
|
f30f099228
server : implement cancellable request (#11285)
|
преди 1 година |
Xuan Son Nguyen
|
0da5d86026
server : allow using LoRA adapters per-request (#10994)
|
преди 1 година |
Xuan Son Nguyen
|
45095a61bf
server : clean up built-in template detection (#11026)
|
преди 1 година |
Georgi Gerganov
|
152610eda9
server : output embeddings for all tokens when pooling = none (#10861)
|
преди 1 година |
Yüg
|
a86ad841f1
server : add flag to disable the web-ui (#10762) (#10751)
|
преди 1 година |
Xuan Son Nguyen
|
ce8784bdb1
server : fix format_infill (#10724)
|
преди 1 година |
Xuan Son Nguyen
|
3573fa8e7b
server : (refactor) no more json in server_task input (#10691)
|
преди 1 година |
Xuan Son Nguyen
|
b782e5c7d4
server : add more test cases (#10569)
|
преди 1 година |
Xuan Son Nguyen
|
6c59567689
server : (tests) don't use thread for capturing stdout/stderr, bump openai client library (#10568)
|
преди 1 година |
Xuan Son Nguyen
|
9f912511bc
common : fix duplicated file name with hf_repo and hf_file (#10550)
|
преди 1 година |
Xuan Son Nguyen
|
45abe0f74e
server : replace behave with pytest (#10416)
|
преди 1 година |