Commit History

Автор SHA1 Съобщение Дата
  Gabe Goodhart fd621880f3 aLoRA Support (#15327) преди 4 месеца
  Gabe Goodhart 5fac79cbc7 Thinking model disabled assistant prefill (#15404) преди 4 месеца
  65a 4afb0a746f server : Support multimodal completion and embeddings prompts in JSON format (#15108) преди 4 месеца
  Johannes Gäßler 494c5899cb scripts: benchmark for HTTP server throughput (#14668) преди 6 месеца
  Sigbjørn Skjæret ddef99522d server : fix assistant prefilling when content is an array (#14360) преди 6 месеца
  matteo caf5681fcb server : support jinja extra template kwargs (Qwen3 enable_thinking feature), from command line and from client (#13196) преди 6 месеца
  Sigbjørn Skjæret 88fc854b4b llama : improve sep token handling (#14272) преди 7 месеца
  Georgi Gerganov 53f925074d sync : vendor (#13901) преди 7 месеца
  Xuan-Son Nguyen 10961339b2 mtmd : move helpers to dedicated library (⚠️ breaking change) (#13866) преди 7 месеца
  Đinh Trọng Huy e0e3aa231d llama : add support for BertForSequenceClassification reranker (#13858) преди 7 месеца
  Sky c962ae3382 server: fix remove 'image_url'/'input_audio' json-object effectlly for 'llama_params' in multimodal-model-mode (#13853) преди 7 месеца
  Olivier Chafik 03f582ae8f server: fix streaming crashes (#13786) преди 7 месеца
  Olivier Chafik e121edc432 `server`: add `--reasoning-budget 0` to disable thinking (incl. qwen3 w/ enable_thinking:false) (#13771) преди 7 месеца
  Olivier Chafik d785f9c1fd server: fix/test add_generation_prompt (#13770) преди 7 месеца
  Olivier Chafik f5cd27b71d `server`: streaming of tool calls and thoughts when `--jinja` is on (#12379) преди 7 месеца
  Xuan-Son Nguyen 9ecf3e66a3 server : support audio input (#13714) преди 7 месеца
  Xuan-Son Nguyen 797990c4bc mtmd : add ultravox audio input (#13623) преди 7 месеца
  Isaac McFadyen 6a2bc8bfb7 server : added --no-prefill-assistant flag (#13608) преди 8 месеца
  Piotr Wilkin (ilintar) c753d7bed0 server : proper error handling for missing elements in messages array (OpenAI compatible backend) (#13540) преди 8 месеца
  Xuan-Son Nguyen 360a9c98e1 server : fix cache_tokens bug with no cache_prompt (#13533) преди 8 месеца
  Anudit Nagar 91159ee9df server : allow content to be null in oaicompat_completion_params_parse (#13477) преди 8 месеца
  Xuan-Son Nguyen 33eff40240 server : vision support via libmtmd (#12898) преди 8 месеца
  Diego Devesa 1d36b3670b llama : move end-user examples to tools directory (#13249) преди 8 месеца