Yuri Khrustalev
|
c053e18a66
chat: Add LFM2 tool handling (#16763)
|
2 месяцев назад |
Pascal
|
12bbc3fa50
refactor: centralize CoT parsing in backend for streaming mode (#16394)
|
3 месяцев назад |
Pascal
|
128d522c04
chat : support Magistral thinking (#16413)
|
3 месяцев назад |
Piotr Wilkin (ilintar)
|
34fcc5a4ac
model : Apertus model implementation (#15852)
|
3 месяцев назад |
crat0z
|
bd0af02fc9
common : fix reasoning before forced tool call via tool_choice = required (#16264)
|
3 месяцев назад |
shun095
|
f432d8d83e
chat: Fix streaming parser for granite models (#15682)
|
4 месяцев назад |
Xuan-Son Nguyen
|
4b8560ab56
chat : fix build on arm64 (#16101)
|
4 месяцев назад |
Jesse
|
88021565f0
chat : Deepseek V3.1 reasoning and tool calling support (OpenAI Style) (#15533)
|
4 месяцев назад |
Gabe Goodhart
|
5fac79cbc7
Thinking model disabled assistant prefill (#15404)
|
4 месяцев назад |
ExtReMLapin
|
4fd1242bef
chat : fixed crash when Hermes 2 <tool_call> had a newline before it (#15639)
|
4 месяцев назад |
Piotr Wilkin (ilintar)
|
b2426e469e
chat : nemotron thinking & toolcalling support (#15676)
|
4 месяцев назад |
Piotr Wilkin (ilintar)
|
60e5eee31f
chat : Seed OSS thinking + tool call support (#15552)
|
4 месяцев назад |
Aldehir Rojas
|
32732f2459
model : gpt-oss add response_format support (#15494)
|
4 месяцев назад |
Daniel Bevenius
|
657b8a77bd
chat: handle gpt-oss return/end token inconsistency (#15421)
|
5 месяцев назад |
Xuan-Son Nguyen
|
e9288e8869
chat : clarify the meaning of reasoning_format (#15408)
|
5 месяцев назад |
Daniel Bevenius
|
5e6229a840
common : fix double bos, use common_chat_templates for add_bos and add_eos (#15326)
|
5 месяцев назад |
Diego Devesa
|
f75b830647
chat : include kwargs in template example (#15309)
|
5 месяцев назад |
Aldehir Rojas
|
b204a5a234
gpt-oss: implement harmony parsing (#15181)
|
5 месяцев назад |
Xuan-Son Nguyen
|
fba5c0d680
chat : hotfix gpt-oss jinja raising an exception (#15243)
|
5 месяцев назад |
Xuan-Son Nguyen
|
53d0a12658
server : allow specifying reasoning_format in HTTP request (#15238)
|
5 месяцев назад |
Sachin Desai
|
3db4da56a5
chat : support Granite model reasoning and tool call (#14864)
|
5 месяцев назад |
Georgi Gerganov
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 месяцев назад |
Sigbjørn Skjæret
|
f324a3b715
chat : only remove double bos/eos if added (#15086)
|
5 месяцев назад |
Jhen-Jie Hong
|
f738989dcb
chat : fix multiple tool_calls on hermes-2-pro (#14962)
|
5 месяцев назад |
kallewoof
|
1a67fcc306
common : avoid logging partial messages (which can contain broken UTF-8 sequences) (#14937)
|
5 месяцев назад |
matteo
|
caf5681fcb
server : support jinja extra template kwargs (Qwen3 enable_thinking feature), from command line and from client (#13196)
|
6 месяцев назад |
Sigbjørn Skjæret
|
e434e69183
common : suggest --jinja when autodetection fails (#14222)
|
7 месяцев назад |
Piotr
|
3cb203c89f
llama-chat : Do not throw when tool parsing fails (#14012)
|
7 месяцев назад |
Olivier Chafik
|
c9bbc77931
`server`: update deepseek reasoning format (pass reasoning_content as diffs) (#13933)
|
7 месяцев назад |
Georgi Gerganov
|
53f925074d
sync : vendor (#13901)
|
7 месяцев назад |