Историја ревизија

Аутор SHA1 Порука Датум
  Daniel Bevenius 657b8a77bd chat: handle gpt-oss return/end token inconsistency (#15421) пре 5 месеци
  Xuan-Son Nguyen e9288e8869 chat : clarify the meaning of reasoning_format (#15408) пре 5 месеци
  Daniel Bevenius 5e6229a840 common : fix double bos, use common_chat_templates for add_bos and add_eos (#15326) пре 5 месеци
  Diego Devesa f75b830647 chat : include kwargs in template example (#15309) пре 5 месеци
  Aldehir Rojas b204a5a234 gpt-oss: implement harmony parsing (#15181) пре 5 месеци
  Xuan-Son Nguyen fba5c0d680 chat : hotfix gpt-oss jinja raising an exception (#15243) пре 5 месеци
  Xuan-Son Nguyen 53d0a12658 server : allow specifying reasoning_format in HTTP request (#15238) пре 5 месеци
  Sachin Desai 3db4da56a5 chat : support Granite model reasoning and tool call (#14864) пре 5 месеци
  Georgi Gerganov fd1234cb46 llama : add gpt-oss (#15091) пре 5 месеци
  Sigbjørn Skjæret f324a3b715 chat : only remove double bos/eos if added (#15086) пре 5 месеци
  Jhen-Jie Hong f738989dcb chat : fix multiple tool_calls on hermes-2-pro (#14962) пре 5 месеци
  kallewoof 1a67fcc306 common : avoid logging partial messages (which can contain broken UTF-8 sequences) (#14937) пре 5 месеци
  matteo caf5681fcb server : support jinja extra template kwargs (Qwen3 enable_thinking feature), from command line and from client (#13196) пре 6 месеци
  Sigbjørn Skjæret e434e69183 common : suggest --jinja when autodetection fails (#14222) пре 7 месеци
  Piotr 3cb203c89f llama-chat : Do not throw when tool parsing fails (#14012) пре 7 месеци
  Olivier Chafik c9bbc77931 `server`: update deepseek reasoning format (pass reasoning_content as diffs) (#13933) пре 7 месеци
  Georgi Gerganov 53f925074d sync : vendor (#13901) пре 7 месеци
  Olivier Chafik 03f582ae8f server: fix streaming crashes (#13786) пре 7 месеци
  Olivier Chafik d74e94c1b3 `server`: fix format of streamed tool call deltas (diff name, fix id location) (#13800) пре 7 месеци
  Olivier Chafik f13847cfb5 server: fix regression on streamed non-chat completion w/ stops (#13785) пре 7 месеци
  Olivier Chafik e121edc432 `server`: add `--reasoning-budget 0` to disable thinking (incl. qwen3 w/ enable_thinking:false) (#13771) пре 7 месеци
  Olivier Chafik f5cd27b71d `server`: streaming of tool calls and thoughts when `--jinja` is on (#12379) пре 7 месеци
  Olivier Chafik aa48e373f2 `server`: inject date_string in llama 3.x template + fix date for firefunction v2 (#12802) пре 8 месеци
  Xuan-Son Nguyen 8c83449cb7 server : (webui) revamp the input area, plus many small UI improvements (#13365) пре 8 месеци
  Olivier Chafik b6930ebc42 `tool-call`: fix non-tool-calling grammar crashes w/ Qwen / Hermes 2 templates (#12900) пре 9 месеци
  Olivier Chafik 4e39a3c332 `server`: extract <think> tags from qwq outputs (#12297) пре 10 месеци
  Olivier Chafik 87c2630546 allow missing content in message if tool_calls provided (#12293) пре 10 месеци
  Olivier Chafik 669912d9a5 `tool-call`: fix Qwen 2.5 Coder support, add micro benchmarks, support trigger patterns for lazy grammars (#12034) пре 10 месеци
  Olivier Chafik 63e489c025 tool-call: refactor common chat / tool-call api (+ tests / fixes) (#11900) пре 11 месеци
  Olivier Chafik f355229692 server: fix type promotion typo causing crashes w/ --jinja w/o tools (#11880) пре 11 месеци