Historique des commits

Auteur SHA1 Message Date
  sasha0552 42cadc74bd server : fix slot selection by lru (#10126) il y a 1 an
  Georgi Gerganov 45950415ed server : fix endpoint checks (#10135) il y a 1 an
  sasha0552 d865d1478c server : fix smart selection of available slot (#10120) il y a 1 an
  Kevin Gibbons 0a683e8088 server : include scheme when printing URL (#10106) il y a 1 an
  Georgi Gerganov 8d8ff71536 llama : remove Tail-Free sampling (#10071) il y a 1 an
  Georgi Gerganov 8125e6cbfc server : don't overfill the batch during infill (#10018) il y a 1 an
  wwoodsTM ff252ea48e llama : add DRY sampler (#9702) il y a 1 an
  Michael Podvitskiy d80fb71f8b llama: string_split fix (#10022) il y a 1 an
  Georgi Gerganov bc5ba007b2 server : check that the prompt fits in the slot's context (#10030) il y a 1 an
  Xuan Son Nguyen 958367bf53 server : refactor slot input data, move tokenizer to HTTP thread (#10023) il y a 1 an
  wwoodsTM 0a1c750c80 server : samplers accept the prompt correctly (#10019) il y a 1 an
  Xuan Son Nguyen cda0e4b648 llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745) il y a 1 an
  Georgi Gerganov 8901755ba3 server : add n_indent parameter for line indentation requirement (#9929) il y a 1 an
  Alexey Parfenov 1f66b699c4 server : fix the disappearance of the end of the text (#9867) il y a 1 an
  Georgi Gerganov 223c25a72f server : improve infill context reuse (#9894) il y a 1 an
  MaggotHATE fbc98b748e sampling : add XTC sampler (#9742) il y a 1 an
  Georgi Gerganov d4c19c0f5c server : accept extra_context for the infill endpoint (#9874) il y a 1 an
  Georgi Gerganov c7181bd294 server : reuse cached context chunks (#9866) il y a 1 an
  Georgi Gerganov edc265661c server : add option to time limit the generation phase (#9865) il y a 1 an
  Georgi Gerganov 1bde94dd02 server : remove self-extend features (#9860) il y a 1 an
  Georgi Gerganov 95c76e8e92 server : remove legacy system_prompt feature (#9857) il y a 1 an
  Georgi Gerganov 11ac9800af llama : improve infill support and special token detection (#9798) il y a 1 an
  Diego Devesa 7eee341bee common : use common_ prefix for common library functions (#9805) il y a 1 an
  Xuan Son Nguyen 458367a906 server : better security control for public deployments (#9776) il y a 1 an
  Georgi Gerganov 8c475b97b8 rerank : use [SEP] token instead of [BOS] (#9737) il y a 1 an
  Georgi Gerganov f4d2b8846a llama : add reranking support (#9510) il y a 1 an
  Xuan Son Nguyen afbbfaa537 server : add more env vars, improve gen-docs (#9635) il y a 1 an
  StrangeBytesDev 0aa15011e3 server : add newline after chat example (#9616) il y a 1 an
  Xuan Son Nguyen 0b3bf966f4 server : add --no-context-shift option (#9607) il y a 1 an
  Georgi Gerganov 6026da52d6 server : clean-up completed tasks from waiting list (#9531) il y a 1 an