| .. |
|
test_basic.py
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
hai 8 meses |
|
test_chat_completion.py
|
ddef99522d
server : fix assistant prefilling when content is an array (#14360)
|
hai 6 meses |
|
test_completion.py
|
f13847cfb5
server: fix regression on streamed non-chat completion w/ stops (#13785)
|
hai 7 meses |
|
test_ctx_shift.py
|
6aa892ec2a
server : do not return error out of context (with ctx shift disabled) (#13577)
|
hai 8 meses |
|
test_embedding.py
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
hai 8 meses |
|
test_infill.py
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
hai 8 meses |
|
test_lora.py
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
hai 8 meses |
|
test_rerank.py
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
hai 8 meses |
|
test_security.py
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
hai 8 meses |
|
test_slot_save.py
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
hai 8 meses |
|
test_speculative.py
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
hai 8 meses |
|
test_template.py
|
e121edc432
`server`: add `--reasoning-budget 0` to disable thinking (incl. qwen3 w/ enable_thinking:false) (#13771)
|
hai 7 meses |
|
test_tokenize.py
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
hai 8 meses |
|
test_tool_call.py
|
c9bbc77931
`server`: update deepseek reasoning format (pass reasoning_content as diffs) (#13933)
|
hai 7 meses |
|
test_vision_api.py
|
9ecf3e66a3
server : support audio input (#13714)
|
hai 7 meses |