Commit History

Author SHA1 Message Date
  Daniel Bevenius b7552cfcbc common : add default embeddings presets (#11677) 11 months ago
  Radoslav Gerganov 1bef571f6a arg : list RPC devices first when using --list-devices (#11655) 11 months ago
  Daniel Bevenius b636228c0a embedding : enable --no-warmup option (#11475) 11 months ago
  Olivier Chafik 6171c9d258 Add Jinja template support (#11016) 1 year ago
  Georgi Gerganov 80d0d6b4b7 common : add -hfd option for the draft model (#11318) 1 year ago
  LostRuins Concedo 6390a998bf tts : add guide tokens support (#11186) 1 year ago
  Radoslav Gerganov 667d72846c rpc : early register backend devices (#11262) 1 year ago
  Xuan Son Nguyen 84a44815f7 cli : auto activate conversation mode if chat template is available (#11214) 1 year ago
  Xuan Son Nguyen 00b4c3da62 common : support tag-based --hf-repo like on ollama (#11195) 1 year ago
  Georgi Gerganov a3c1232c3f arg : option to exclude arguments from specific examples (#11136) 1 year ago
  Georgi Gerganov f66f582927 llama : refactor `src/llama.cpp` (#10902) 1 year ago
  Molly Sophia 0a11f8b7b5 convert : fix RWKV v6 model conversion (#10913) 1 year ago
  Georgi Gerganov 36319dec5d tts : small QoL for easy model fetch (#10903) 1 year ago
  Georgi Gerganov 0bf2d10c55 tts : add OuteTTS support (#10784) 1 year ago
  Georgi Gerganov 644fd71b44 sampling : refactor + optimize penalties sampler (#10803) 1 year ago
  Xuan Son Nguyen adffa6ffd5 common : improve -ctv -ctk CLI arguments (#10806) 1 year ago
  Xuan Son Nguyen 9fdb124304 common : add missing env var for speculative (#10801) 1 year ago
  Bartowski ae4b922614 imatrix : Add imatrix to --no-context-shift (#10766) 1 year ago
  Yüg a86ad841f1 server : add flag to disable the web-ui (#10762) (#10751) 1 year ago
  Xuan Son Nguyen f162d45a21 common : bring back --no-warmup to server (#10686) 1 year ago
  Xuan Son Nguyen 642330ac7c llama : add enum for built-in chat templates (#10623) 1 year ago
  Johannes Gäßler 890719311b common: fix warning message when no GPU found (#10564) 1 year ago
  Xuan Son Nguyen 9f912511bc common : fix duplicated file name with hf_repo and hf_file (#10550) 1 year ago
  Diego Devesa 10bce0450f llama : accept a list of devices to use to offload a model (#10497) 1 year ago
  Georgi Gerganov d9d54e498d speculative : refactor and add a simpler example (#10362) 1 year ago
  Johannes Gäßler 4e54be0ec6 llama/ex: remove --logdir argument (#10339) 1 year ago
  Georgi Gerganov 8d8ff71536 llama : remove Tail-Free sampling (#10071) 1 year ago
  wwoodsTM ff252ea48e llama : add DRY sampler (#9702) 1 year ago
  Michael Podvitskiy d80fb71f8b llama: string_split fix (#10022) 1 year ago
  Daniel Bevenius 674804a996 arg : fix typo in embeddings argument help [no ci] (#9994) 1 year ago