Georgi Gerganov 152610eda9 server : output embeddings for all tokens when pooling = none (#10861) 1 year ago
..
test_basic.py a86ad841f1 server : add flag to disable the web-ui (#10762) (#10751) 1 year ago
test_chat_completion.py 3573fa8e7b server : (refactor) no more json in server_task input (#10691) 1 year ago
test_completion.py 0e70ba686e server : add "tokens" output (#10853) 1 year ago
test_ctx_shift.py 45abe0f74e server : replace behave with pytest (#10416) 1 year ago
test_embedding.py 152610eda9 server : output embeddings for all tokens when pooling = none (#10861) 1 year ago
test_infill.py ce8784bdb1 server : fix format_infill (#10724) 1 year ago
test_lora.py 45abe0f74e server : replace behave with pytest (#10416) 1 year ago
test_rerank.py 05c3a444b8 server : fill usage info in embeddings and rerank responses (#10852) 1 year ago
test_security.py 45abe0f74e server : replace behave with pytest (#10416) 1 year ago
test_slot_save.py 45abe0f74e server : replace behave with pytest (#10416) 1 year ago
test_speculative.py 1da7b76569 server : fix speculative decoding with context shift (#10641) 1 year ago
test_tokenize.py 45abe0f74e server : replace behave with pytest (#10416) 1 year ago