Commit History

Autor SHA1 Mensaxe Data
  Oleksandr Kuvshynov 408616adbd server : [easy] fix per round speculative decode logging (#18211) hai 3 semanas
  Aman Gupta cc0a04343e server: friendlier error msg when ctx < input (#18174) hai 4 semanas
  Pascal 6ce3d85796 server: (webui) add --webui-config (#18028) hai 1 mes
  Georgi Gerganov 254098a279 common : refactor common_sampler + grammar logic changes (#17937) hai 1 mes
  Xuan-Son Nguyen 6c2131773c cli: new CLI experience (#17824) hai 1 mes
  Xuan-Son Nguyen 951520ddb0 server: delegate result_state creation to server_task (#17835) hai 1 mes
  Xuan-Son Nguyen f896d2c34f server: improve speed of speculative decoding (#17808) hai 1 mes
  Georgi Gerganov 2bc96931d2 server : make cache_reuse configurable per request (#17858) hai 1 mes
  Xuan-Son Nguyen c42712b056 server: support multiple generations from one prompt (OAI "n" option) (#17775) hai 1 mes
  Xuan-Son Nguyen c4c10bfb86 server: move msg diffs tracking to HTTP thread (#17740) hai 1 mes
  Xuan-Son Nguyen 13628d8bdb server: add --media-path for local media files (#17697) hai 1 mes
  Xuan-Son Nguyen 5d6bd842ea server: remove default "gpt-3.5-turbo" model name (#17668) hai 1 mes
  Xuan-Son Nguyen ecf74a8417 mtmd: add mtmd_context_params::warmup option (#17652) hai 1 mes
  Xuan-Son Nguyen ab49f094d2 server: move server-context to its own cpp|h (#17595) hai 1 mes