Olivier Chafik
|
e121edc432
`server`: add `--reasoning-budget 0` to disable thinking (incl. qwen3 w/ enable_thinking:false) (#13771)
|
8 months ago |
Isaac McFadyen
|
6a2bc8bfb7
server : added --no-prefill-assistant flag (#13608)
|
8 months ago |
Georgi Gerganov
|
053174436f
server : passthrough the /models endpoint during loading (#13535)
|
8 months ago |
Xuan-Son Nguyen
|
3b24d26c22
server : update docs (#13432)
|
8 months ago |
Xuan-Son Nguyen
|
33eff40240
server : vision support via libmtmd (#12898)
|
8 months ago |
Diego Devesa
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
8 months ago |