Commit History

Author SHA1 Message Date
  Oleksandr Kuvshynov 408616adbd server : [easy] fix per round speculative decode logging (#18211) 3 weeks ago
  Aman Gupta cc0a04343e server: friendlier error msg when ctx < input (#18174) 4 weeks ago
  Pascal 6ce3d85796 server: (webui) add --webui-config (#18028) 1 month ago
  Georgi Gerganov 254098a279 common : refactor common_sampler + grammar logic changes (#17937) 1 month ago
  Xuan-Son Nguyen 6c2131773c cli: new CLI experience (#17824) 1 month ago
  Xuan-Son Nguyen 951520ddb0 server: delegate result_state creation to server_task (#17835) 1 month ago
  Xuan-Son Nguyen f896d2c34f server: improve speed of speculative decoding (#17808) 1 month ago
  Georgi Gerganov 2bc96931d2 server : make cache_reuse configurable per request (#17858) 1 month ago
  Xuan-Son Nguyen c42712b056 server: support multiple generations from one prompt (OAI "n" option) (#17775) 1 month ago
  Xuan-Son Nguyen c4c10bfb86 server: move msg diffs tracking to HTTP thread (#17740) 1 month ago
  Xuan-Son Nguyen 13628d8bdb server: add --media-path for local media files (#17697) 1 month ago
  Xuan-Son Nguyen 5d6bd842ea server: remove default "gpt-3.5-turbo" model name (#17668) 1 month ago
  Xuan-Son Nguyen ecf74a8417 mtmd: add mtmd_context_params::warmup option (#17652) 1 month ago
  Xuan-Son Nguyen ab49f094d2 server: move server-context to its own cpp|h (#17595) 1 month ago