Commit History

Autor SHA1 Mensaxe Data
  Alexey Parfenov a803333a4e common : use enums for sampler types (#5418) hai 1 ano
  Georgi Gerganov 139b62a839 common : fix compile warning hai 1 ano
  Johannes Gäßler 26d4efd11e sampling: fix top_k <= 0 (#5388) hai 1 ano
  Michael Klimenko 35a2ee9143 Remove unused data and add fixes (#5154) %!s(int64=2) %!d(string=hai) anos
  l3utterfly 5eaf9964fc llama : dynamic temperature sampling (#4972) %!s(int64=2) %!d(string=hai) anos
  David Friehs 4483396751 llama : apply classifier-free guidance to logits directly (#4951) %!s(int64=2) %!d(string=hai) anos
  Alexey Parfenov 6123979952 server : allow to specify custom prompt for penalty calculation (#3727) %!s(int64=2) %!d(string=hai) anos
  kalomaze b9ec82d262 grammar : check the full vocab only if necessary (opt) (#4306) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov caa9249217 common : fix compile warning %!s(int64=2) %!d(string=hai) anos
  MaggotHATE 52c8bc3cf3 sampling : custom samplers order (#4285) %!s(int64=2) %!d(string=hai) anos
  l3utterfly e75dfdd31b sampling : null grammar field after reset (#3885) %!s(int64=2) %!d(string=hai) anos
  kalomaze 238657db23 samplers : Min-P sampler implementation [alternative to Top P/Top K] (#3841) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov ee1a0ec9cb llama : add option for greedy sampling with probs (#3813) %!s(int64=2) %!d(string=hai) anos
  Marcus Dunn 5be6c803fa llama : remove token functions with `context` args in favor of `model` (#3720) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov d1031cf49c sampling : refactor init to use llama_sampling_params (#3696) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 0e89203b51 speculative : add tree-based sampling example (#3624) %!s(int64=2) %!d(string=hai) anos
  Kerfuffle 70c29da118 common : fix mirostat state when using multiple sequences (#3543) %!s(int64=2) %!d(string=hai) anos