Commit History

Автор SHA1 Съобщение Дата
  Xuan Son Nguyen 642330ac7c llama : add enum for built-in chat templates (#10623) преди 1 година
  Georgi Gerganov 8648c52101 make : deprecate (#10514) преди 1 година
  haopeng 64ed2091b2 server: Add "tokens per second" information in the backend (#10548) преди 1 година
  Georgi Gerganov 47f931c8f9 server : enable cache_prompt by default (#10501) преди 1 година
  Johannes Gäßler 4e54be0ec6 llama/ex: remove --logdir argument (#10339) преди 1 година
  Alexey Parfenov ff7fb670d0 server : add missing docs (#10269) преди 1 година
  Georgi Gerganov b141e5f6ef server : enable KV cache defrag by default (#10233) преди 1 година
  Xuan Son Nguyen a71d81cf8c server : revamp chat UI with vuejs and daisyui (#10175) преди 1 година
  Xuan Son Nguyen 9e0ecfb697 server : clarify /slots endpoint, add is_processing (#10162) преди 1 година
  Georgi Gerganov 8d8ff71536 llama : remove Tail-Free sampling (#10071) преди 1 година
  wwoodsTM ff252ea48e llama : add DRY sampler (#9702) преди 1 година
  Xuan Son Nguyen 958367bf53 server : refactor slot input data, move tokenizer to HTTP thread (#10023) преди 1 година
  Georgi Gerganov 8901755ba3 server : add n_indent parameter for line indentation requirement (#9929) преди 1 година
  Georgi Gerganov 223c25a72f server : improve infill context reuse (#9894) преди 1 година
  Georgi Gerganov d4c19c0f5c server : accept extra_context for the infill endpoint (#9874) преди 1 година
  Georgi Gerganov c7181bd294 server : reuse cached context chunks (#9866) преди 1 година
  Georgi Gerganov edc265661c server : add option to time limit the generation phase (#9865) преди 1 година
  Georgi Gerganov 1bde94dd02 server : remove self-extend features (#9860) преди 1 година
  Georgi Gerganov 95c76e8e92 server : remove legacy system_prompt feature (#9857) преди 1 година
  Georgi Gerganov 11ac9800af llama : improve infill support and special token detection (#9798) преди 1 година
  Xuan Son Nguyen 458367a906 server : better security control for public deployments (#9776) преди 1 година
  Daniel Kleine 133c7b46b3 Fixed RNG seed docs (#9723) преди 1 година
  Georgi Gerganov f4d2b8846a llama : add reranking support (#9510) преди 1 година
  Xuan Son Nguyen afbbfaa537 server : add more env vars, improve gen-docs (#9635) преди 1 година
  Xuan Son Nguyen 0b3bf966f4 server : add --no-context-shift option (#9607) преди 1 година
  Vinesh Janarthanan 8a308354f6 server : match OAI structured output response (#9527) преди 1 година
  Bert Wagner 8b836ae731 arg : add env variable for parallel (#9513) преди 1 година
  Georgi Gerganov 6262d13e0b common : reimplement logging (#9418) преди 1 година
  Mathijs Henquet 78203641fe server : Add option to return token pieces in /tokenize endpoint (#9108) преди 1 година
  Xuan Son Nguyen bfe76d4a17 common : move arg parser code to `arg.cpp` (#9388) преди 1 година