Georgi Gerganov 1da7b76569 server : fix speculative decoding with context shift (#10641) 1 year ago
..
test_basic.py b782e5c7d4 server : add more test cases (#10569) 1 year ago
test_chat_completion.py 64ed2091b2 server: Add "tokens per second" information in the backend (#10548) 1 year ago
test_completion.py 45abe0f74e server : replace behave with pytest (#10416) 1 year ago
test_ctx_shift.py 45abe0f74e server : replace behave with pytest (#10416) 1 year ago
test_embedding.py 45abe0f74e server : replace behave with pytest (#10416) 1 year ago
test_infill.py b782e5c7d4 server : add more test cases (#10569) 1 year ago
test_lora.py 45abe0f74e server : replace behave with pytest (#10416) 1 year ago
test_rerank.py b782e5c7d4 server : add more test cases (#10569) 1 year ago
test_security.py 45abe0f74e server : replace behave with pytest (#10416) 1 year ago
test_slot_save.py 45abe0f74e server : replace behave with pytest (#10416) 1 year ago
test_speculative.py 1da7b76569 server : fix speculative decoding with context shift (#10641) 1 year ago
test_tokenize.py 45abe0f74e server : replace behave with pytest (#10416) 1 year ago