Yann Follet 31d0ff1869 server / ranking : add sorting and management of top_n (#16403) 3 bulan lalu
..
test_basic.py d00cbea63c server : host-memory prompt caching (#16391) 3 bulan lalu
test_chat_completion.py 68ee98ae18 server : return HTTP 400 if prompt exceeds context length (#16486) 3 bulan lalu
test_completion.py d00cbea63c server : host-memory prompt caching (#16391) 3 bulan lalu
test_ctx_shift.py d00cbea63c server : host-memory prompt caching (#16391) 3 bulan lalu
test_embedding.py d2fcd91cf9 server : disable context shift by default (#15416) 5 bulan lalu
test_infill.py d2fcd91cf9 server : disable context shift by default (#15416) 5 bulan lalu
test_lora.py d2fcd91cf9 server : disable context shift by default (#15416) 5 bulan lalu
test_rerank.py 31d0ff1869 server / ranking : add sorting and management of top_n (#16403) 3 bulan lalu
test_security.py d2fcd91cf9 server : disable context shift by default (#15416) 5 bulan lalu
test_slot_save.py d2fcd91cf9 server : disable context shift by default (#15416) 5 bulan lalu
test_speculative.py e81b8e4b7f llama: use FA + max. GPU layers by default (#15434) 4 bulan lalu
test_template.py 3c3635d2f2 server : speed up tests (#15836) 4 bulan lalu
test_tokenize.py d2fcd91cf9 server : disable context shift by default (#15416) 5 bulan lalu
test_tool_call.py 3c3635d2f2 server : speed up tests (#15836) 4 bulan lalu
test_vision_api.py 3c3635d2f2 server : speed up tests (#15836) 4 bulan lalu