Historique des commits

Auteur SHA1 Message Date
  LostRuins Concedo 6390a998bf tts : add guide tokens support (#11186) il y a 1 an
  Radoslav Gerganov 667d72846c rpc : early register backend devices (#11262) il y a 1 an
  Xuan Son Nguyen 84a44815f7 cli : auto activate conversation mode if chat template is available (#11214) il y a 1 an
  Xuan Son Nguyen 00b4c3da62 common : support tag-based --hf-repo like on ollama (#11195) il y a 1 an
  Georgi Gerganov a3c1232c3f arg : option to exclude arguments from specific examples (#11136) il y a 1 an
  Georgi Gerganov f66f582927 llama : refactor `src/llama.cpp` (#10902) il y a 1 an
  Molly Sophia 0a11f8b7b5 convert : fix RWKV v6 model conversion (#10913) il y a 1 an
  Georgi Gerganov 36319dec5d tts : small QoL for easy model fetch (#10903) il y a 1 an
  Georgi Gerganov 0bf2d10c55 tts : add OuteTTS support (#10784) il y a 1 an
  Georgi Gerganov 644fd71b44 sampling : refactor + optimize penalties sampler (#10803) il y a 1 an
  Xuan Son Nguyen adffa6ffd5 common : improve -ctv -ctk CLI arguments (#10806) il y a 1 an
  Xuan Son Nguyen 9fdb124304 common : add missing env var for speculative (#10801) il y a 1 an
  Bartowski ae4b922614 imatrix : Add imatrix to --no-context-shift (#10766) il y a 1 an
  Yüg a86ad841f1 server : add flag to disable the web-ui (#10762) (#10751) il y a 1 an
  Xuan Son Nguyen f162d45a21 common : bring back --no-warmup to server (#10686) il y a 1 an
  Xuan Son Nguyen 642330ac7c llama : add enum for built-in chat templates (#10623) il y a 1 an
  Johannes Gäßler 890719311b common: fix warning message when no GPU found (#10564) il y a 1 an
  Xuan Son Nguyen 9f912511bc common : fix duplicated file name with hf_repo and hf_file (#10550) il y a 1 an
  Diego Devesa 10bce0450f llama : accept a list of devices to use to offload a model (#10497) il y a 1 an
  Georgi Gerganov d9d54e498d speculative : refactor and add a simpler example (#10362) il y a 1 an
  Johannes Gäßler 4e54be0ec6 llama/ex: remove --logdir argument (#10339) il y a 1 an
  Georgi Gerganov 8d8ff71536 llama : remove Tail-Free sampling (#10071) il y a 1 an
  wwoodsTM ff252ea48e llama : add DRY sampler (#9702) il y a 1 an
  Michael Podvitskiy d80fb71f8b llama: string_split fix (#10022) il y a 1 an
  Daniel Bevenius 674804a996 arg : fix typo in embeddings argument help [no ci] (#9994) il y a 1 an
  Daniel Bevenius 94008cc760 arg : fix attention non-causal arg value hint (#9985) il y a 1 an
  MaggotHATE fbc98b748e sampling : add XTC sampler (#9742) il y a 1 an
  Georgi Gerganov c7181bd294 server : reuse cached context chunks (#9866) il y a 1 an
  Georgi Gerganov 1bde94dd02 server : remove self-extend features (#9860) il y a 1 an
  Georgi Gerganov 95c76e8e92 server : remove legacy system_prompt feature (#9857) il y a 1 an