Commit History

Autor SHA1 Mensaxe Data
  Georgi Gerganov 196f5083ef common : more accurate sampling timing (#17382) hai 1 mes
  Marek Hradil jr. 6cd0cf72ce fix : Dangling pointer for non-empty trigger words in lazy grammar construction (#17048) hai 2 meses
  Georgi Gerganov 81086cd6a3 vocab : mark EOT token for Granite models (#16499) hai 3 meses
  Georgi Gerganov cdedb70a99 sampling : optimize dist sampler (#15704) hai 4 meses
  Georgi Gerganov e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665) hai 4 meses
  Georgi Gerganov f9cd68398b sampling : make sure samplers return at least 1 token (#13822) hai 7 meses
  DocShotgun ffc727203a sampling : make top_n_sigma no-op at <=0 or a single candidate (#13345) hai 8 meses
  oobabooga 91a86a6f35 sampling : don't consider -infinity values in top_n_sigma (#13344) hai 8 meses
  oobabooga 233461f812 sampling : Integrate Top-nσ into main sampling chain (and add it to the server) (#13264) hai 8 meses
  Georgi Gerganov d9d398f84f sampling : when top-k <= 0 -> noop (#13173) hai 8 meses
  Johannes Gäßler dd373dd3bf llama: fix error on bad grammar (#12628) hai 9 meses
  Olivier Chafik 669912d9a5 `tool-call`: fix Qwen 2.5 Coder support, add micro benchmarks, support trigger patterns for lazy grammars (#12034) hai 10 meses
  Vinesh Janarthanan 27e8a23300 sampling: add Top-nσ sampler (#11223) hai 11 meses
  Christian Fillion 7ee953a64a llama : add llama_sampler_init for safe usage of llama_sampler_free (#11727) hai 11 meses
  Olivier Chafik 8b576b6c55 Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639) hai 11 meses
  Georgi Gerganov afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) hai 1 ano
  Georgi Gerganov 727368c60f llama : use LLAMA_TOKEN_NULL (#11062) hai 1 ano
  Georgi Gerganov f66f582927 llama : refactor `src/llama.cpp` (#10902) hai 1 ano
  Georgi Gerganov 644fd71b44 sampling : refactor + optimize penalties sampler (#10803) hai 1 ano
  wwoodsTM 5107e8cea3 DRY: Fixes clone functionality (#10192) hai 1 ano
  Georgi Gerganov 8d8ff71536 llama : remove Tail-Free sampling (#10071) hai 1 ano
  wwoodsTM ff252ea48e llama : add DRY sampler (#9702) hai 1 ano
  Georgi Gerganov 55e47786e3 llama : default sampling changes + greedy update (#9897) hai 1 ano
  Georgi Gerganov 99bd4ac28c llama : infill sampling handle very long tokens (#9924) hai 1 ano
  Georgi Gerganov 755a9b2bf0 llama : add infill sampler (#9896) hai 1 ano
  MaggotHATE fbc98b748e sampling : add XTC sampler (#9742) hai 1 ano
  Georgi Gerganov b0f27361f3 sampling : avoid expensive softmax during greedy sampling (#9605) hai 1 ano
  Daniel Bevenius 6443ddd985 llama : use reserve/emplace_back in sampler_sample (#9534) hai 1 ano
  Georgi Gerganov 0abc6a2c25 llama : llama_perf + option to disable timings during decode (#9355) hai 1 ano
  Gilad S. bd35cb0ae3 feat: remove a sampler from a chain (#9445) hai 1 ano