Aldehir Rojas
|
c05aa69f32
common : add nemotron 3 parsing (#18077)
|
1 месяц назад |
Aldehir Rojas
|
2fbe3b7bb7
common : add parser for ministral/mistral large 3/devstral 2 (#17713)
|
1 месяц назад |
Georgi Gerganov
|
190c4838bd
chat : reserve memory in compute_diffs and improve naming (#17729)
|
1 месяц назад |
Aldehir Rojas
|
0a8026e768
common : introduce composable PEG parser combinators for chat parsing (#17136)
|
1 месяц назад |
Chad Voegele
|
c4357dcc35
Server: Change Invalid Schema from Server Error (500) to User Error (400) (#17572)
|
1 месяц назад |
DAN™
|
03914c7ef8
common : move all common_chat_parse_* to chat-parser.cpp. (#17481)
|
1 месяц назад |
Xuan-Son Nguyen
|
10e9780154
chat: fix int overflow, prevent size calculation in float/double (#17357)
|
2 месяцев назад |
hksdpc255
|
1920345c3b
common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932)
|
2 месяцев назад |
Aldehir Rojas
|
87c9efc3b2
common : move gpt-oss reasoning processing to init params (#16937)
|
2 месяцев назад |
Yuri Khrustalev
|
c053e18a66
chat: Add LFM2 tool handling (#16763)
|
2 месяцев назад |
Pascal
|
12bbc3fa50
refactor: centralize CoT parsing in backend for streaming mode (#16394)
|
3 месяцев назад |
Pascal
|
128d522c04
chat : support Magistral thinking (#16413)
|
3 месяцев назад |
Piotr Wilkin (ilintar)
|
34fcc5a4ac
model : Apertus model implementation (#15852)
|
3 месяцев назад |
crat0z
|
bd0af02fc9
common : fix reasoning before forced tool call via tool_choice = required (#16264)
|
3 месяцев назад |
shun095
|
f432d8d83e
chat: Fix streaming parser for granite models (#15682)
|
4 месяцев назад |
Xuan-Son Nguyen
|
4b8560ab56
chat : fix build on arm64 (#16101)
|
4 месяцев назад |
Jesse
|
88021565f0
chat : Deepseek V3.1 reasoning and tool calling support (OpenAI Style) (#15533)
|
4 месяцев назад |
Gabe Goodhart
|
5fac79cbc7
Thinking model disabled assistant prefill (#15404)
|
4 месяцев назад |
ExtReMLapin
|
4fd1242bef
chat : fixed crash when Hermes 2 <tool_call> had a newline before it (#15639)
|
4 месяцев назад |
Piotr Wilkin (ilintar)
|
b2426e469e
chat : nemotron thinking & toolcalling support (#15676)
|
4 месяцев назад |
Piotr Wilkin (ilintar)
|
60e5eee31f
chat : Seed OSS thinking + tool call support (#15552)
|
4 месяцев назад |
Aldehir Rojas
|
32732f2459
model : gpt-oss add response_format support (#15494)
|
4 месяцев назад |
Daniel Bevenius
|
657b8a77bd
chat: handle gpt-oss return/end token inconsistency (#15421)
|
5 месяцев назад |
Xuan-Son Nguyen
|
e9288e8869
chat : clarify the meaning of reasoning_format (#15408)
|
5 месяцев назад |
Daniel Bevenius
|
5e6229a840
common : fix double bos, use common_chat_templates for add_bos and add_eos (#15326)
|
5 месяцев назад |
Diego Devesa
|
f75b830647
chat : include kwargs in template example (#15309)
|
5 месяцев назад |
Aldehir Rojas
|
b204a5a234
gpt-oss: implement harmony parsing (#15181)
|
5 месяцев назад |
Xuan-Son Nguyen
|
fba5c0d680
chat : hotfix gpt-oss jinja raising an exception (#15243)
|
5 месяцев назад |
Xuan-Son Nguyen
|
53d0a12658
server : allow specifying reasoning_format in HTTP request (#15238)
|
5 месяцев назад |
Sachin Desai
|
3db4da56a5
chat : support Granite model reasoning and tool call (#14864)
|
5 месяцев назад |