Olivier Chafik
|
1a24c4621f
`server`: fix deadly typo in response_format.json_schema.schema handling (#12168)
|
10 ヶ月 前 |
Olivier Chafik
|
63e489c025
tool-call: refactor common chat / tool-call api (+ tests / fixes) (#11900)
|
11 ヶ月 前 |
Olivier Chafik
|
cde3833239
`tool-call`: allow `--chat-template chatml` w/ `--jinja`, default to chatml upon parsing issue, avoid double bos (#11616)
|
11 ヶ月 前 |
Olivier Chafik
|
4a2b196d03
server : fix --jinja when there's no tools or schema (typo was forcing JSON) (#11531)
|
11 ヶ月 前 |
Olivier Chafik
|
8b576b6c55
Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639)
|
11 ヶ月 前 |
Nigel Bosch
|
eb7cf15a80
server : add /apply-template endpoint for additional use cases of Minja functionality (#11489)
|
11 ヶ月 前 |
Olivier Chafik
|
6171c9d258
Add Jinja template support (#11016)
|
1 年間 前 |
Xuan Son Nguyen
|
45095a61bf
server : clean up built-in template detection (#11026)
|
1 年間 前 |
Xuan Son Nguyen
|
5896c65232
server : add OAI compat for /v1/completions (#10974)
|
1 年間 前 |
Xuan Son Nguyen
|
485dc01214
server : add system_fingerprint to chat/completion (#10917)
|
1 年間 前 |
Xuan Son Nguyen
|
57bb2c40cd
server : fix logprobs, make it OAI-compatible (#10783)
|
1 年間 前 |
Xuan Son Nguyen
|
3573fa8e7b
server : (refactor) no more json in server_task input (#10691)
|
1 年間 前 |
Xuan Son Nguyen
|
6c5bc0625f
server : (refactoring) do not rely on JSON internally (#10643)
|
1 年間 前 |
haopeng
|
64ed2091b2
server: Add "tokens per second" information in the backend (#10548)
|
1 年間 前 |
Xuan Son Nguyen
|
b782e5c7d4
server : add more test cases (#10569)
|
1 年間 前 |
Xuan Son Nguyen
|
45abe0f74e
server : replace behave with pytest (#10416)
|
1 年間 前 |