Histórico de Commits

Autor SHA1 Mensagem Data
  Georgi Gerganov f5f8812f7c server : use different seeds for child completions (#18700) há 3 semanas atrás
  Tarek Dakhran 73d284a250 model : add LFM2-ColBert-350M (#18607) há 3 semanas atrás
  Daniel Bevenius d3dce4e0a5 sampling : add support for backend sampling (#17004) há 4 semanas atrás
  Georgi Gerganov 2a85f720b8 server : handle closed connection for tasks (#18459) há 1 mês atrás
  o7si 4893cc07bb server : fix crash when seq_rm fails for hybrid/recurrent models (#18391) há 1 mês atrás
  Xuan-Son Nguyen 5ee4e43f26 server: return_progress to also report 0% processing state (#18305) há 1 mês atrás
  Xuan-Son Nguyen 849d021104 server: fix crash with model not having BOS/EOS (#18321) há 1 mês atrás
  Xuan-Son Nguyen 6ce863c803 server: prevent data race from HTTP threads (#18263) há 1 mês atrás
  Xuan-Son Nguyen ddcb75dd8a server: add auto-sleep after N seconds of idle (#18228) há 1 mês atrás
  Oleksandr Kuvshynov 408616adbd server : [easy] fix per round speculative decode logging (#18211) há 1 mês atrás
  Aman Gupta cc0a04343e server: friendlier error msg when ctx < input (#18174) há 1 mês atrás
  Pascal 6ce3d85796 server: (webui) add --webui-config (#18028) há 1 mês atrás
  Georgi Gerganov 254098a279 common : refactor common_sampler + grammar logic changes (#17937) há 1 mês atrás
  Xuan-Son Nguyen 6c2131773c cli: new CLI experience (#17824) há 1 mês atrás
  Xuan-Son Nguyen 951520ddb0 server: delegate result_state creation to server_task (#17835) há 1 mês atrás
  Xuan-Son Nguyen f896d2c34f server: improve speed of speculative decoding (#17808) há 1 mês atrás
  Georgi Gerganov 2bc96931d2 server : make cache_reuse configurable per request (#17858) há 1 mês atrás
  Xuan-Son Nguyen c42712b056 server: support multiple generations from one prompt (OAI "n" option) (#17775) há 1 mês atrás
  Xuan-Son Nguyen c4c10bfb86 server: move msg diffs tracking to HTTP thread (#17740) há 1 mês atrás
  Xuan-Son Nguyen 13628d8bdb server: add --media-path for local media files (#17697) há 2 meses atrás
  Xuan-Son Nguyen 5d6bd842ea server: remove default "gpt-3.5-turbo" model name (#17668) há 2 meses atrás
  Xuan-Son Nguyen ecf74a8417 mtmd: add mtmd_context_params::warmup option (#17652) há 2 meses atrás
  Xuan-Son Nguyen ab49f094d2 server: move server-context to its own cpp|h (#17595) há 2 meses atrás