Commit History

Author SHA1 Message Date
  Pascal 12bbc3fa50 refactor: centralize CoT parsing in backend for streaming mode (#16394) 3 months ago
  Pascal 128d522c04 chat : support Magistral thinking (#16413) 3 months ago
  Piotr Wilkin (ilintar) 34fcc5a4ac model : Apertus model implementation (#15852) 3 months ago
  crat0z bd0af02fc9 common : fix reasoning before forced tool call via tool_choice = required (#16264) 3 months ago
  shun095 f432d8d83e chat: Fix streaming parser for granite models (#15682) 4 months ago
  Xuan-Son Nguyen 4b8560ab56 chat : fix build on arm64 (#16101) 4 months ago
  Jesse 88021565f0 chat : Deepseek V3.1 reasoning and tool calling support (OpenAI Style) (#15533) 4 months ago
  Gabe Goodhart 5fac79cbc7 Thinking model disabled assistant prefill (#15404) 4 months ago
  ExtReMLapin 4fd1242bef chat : fixed crash when Hermes 2 <tool_call> had a newline before it (#15639) 4 months ago
  Piotr Wilkin (ilintar) b2426e469e chat : nemotron thinking & toolcalling support (#15676) 4 months ago
  Piotr Wilkin (ilintar) 60e5eee31f chat : Seed OSS thinking + tool call support (#15552) 4 months ago
  Aldehir Rojas 32732f2459 model : gpt-oss add response_format support (#15494) 4 months ago
  Daniel Bevenius 657b8a77bd chat: handle gpt-oss return/end token inconsistency (#15421) 5 months ago
  Xuan-Son Nguyen e9288e8869 chat : clarify the meaning of reasoning_format (#15408) 5 months ago
  Daniel Bevenius 5e6229a840 common : fix double bos, use common_chat_templates for add_bos and add_eos (#15326) 5 months ago
  Diego Devesa f75b830647 chat : include kwargs in template example (#15309) 5 months ago
  Aldehir Rojas b204a5a234 gpt-oss: implement harmony parsing (#15181) 5 months ago
  Xuan-Son Nguyen fba5c0d680 chat : hotfix gpt-oss jinja raising an exception (#15243) 5 months ago
  Xuan-Son Nguyen 53d0a12658 server : allow specifying reasoning_format in HTTP request (#15238) 5 months ago
  Sachin Desai 3db4da56a5 chat : support Granite model reasoning and tool call (#14864) 5 months ago
  Georgi Gerganov fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
  Sigbjørn Skjæret f324a3b715 chat : only remove double bos/eos if added (#15086) 5 months ago
  Jhen-Jie Hong f738989dcb chat : fix multiple tool_calls on hermes-2-pro (#14962) 5 months ago
  kallewoof 1a67fcc306 common : avoid logging partial messages (which can contain broken UTF-8 sequences) (#14937) 5 months ago
  matteo caf5681fcb server : support jinja extra template kwargs (Qwen3 enable_thinking feature), from command line and from client (#13196) 6 months ago
  Sigbjørn Skjæret e434e69183 common : suggest --jinja when autodetection fails (#14222) 7 months ago
  Piotr 3cb203c89f llama-chat : Do not throw when tool parsing fails (#14012) 7 months ago
  Olivier Chafik c9bbc77931 `server`: update deepseek reasoning format (pass reasoning_content as diffs) (#13933) 7 months ago
  Georgi Gerganov 53f925074d sync : vendor (#13901) 7 months ago
  Olivier Chafik 03f582ae8f server: fix streaming crashes (#13786) 7 months ago