Historique des commits

Auteur SHA1 Message Date
  Xuan Son Nguyen afbbfaa537 server : add more env vars, improve gen-docs (#9635) il y a 1 an
  StrangeBytesDev 0aa15011e3 server : add newline after chat example (#9616) il y a 1 an
  Xuan Son Nguyen 0b3bf966f4 server : add --no-context-shift option (#9607) il y a 1 an
  Georgi Gerganov 6026da52d6 server : clean-up completed tasks from waiting list (#9531) il y a 1 an
  Eric Zhang f799155ab8 server : fix OpenSSL build (remove obsolete `LOG_INFO`) (#9529) il y a 1 an
  Georgi Gerganov 6262d13e0b common : reimplement logging (#9418) il y a 1 an
  VoidIsVoid dcdcee3a74 server: add data: [DONE] to /chat/completions stream response (#9459) il y a 1 an
  Xuan Son Nguyen feff4aa846 server : add loading html page while model is loading (#9468) il y a 1 an
  Mathijs Henquet 78203641fe server : Add option to return token pieces in /tokenize endpoint (#9108) il y a 1 an
  slaren 49006c67b4 llama : move random seed generation to the samplers (#9398) il y a 1 an
  Xuan Son Nguyen bfe76d4a17 common : move arg parser code to `arg.cpp` (#9388) il y a 1 an
  slaren 5fb5e24811 llama : minor sampling refactor (2) (#9386) il y a 1 an
  Xuan Son Nguyen 1b9ae5189c common : refactor arg parser (#9308) il y a 1 an
  Georgi Gerganov df270ef745 llama : refactor sampling v2 (#9294) il y a 1 an
  Xuan Son Nguyen 9b2c24c099 server : simplify state machine for slot (#9283) il y a 1 an
  Xuan Son Nguyen 4a1411b4f1 server : fix missing lock (#9334) il y a 1 an
  Xuan Son Nguyen 6e7d133a5f server : refactor multitask handling (#9274) il y a 1 an
  Faisal Zaghloul 42c76d1358 Threadpool: take 2 (#8672) il y a 1 an
  Jan Boon 9f7d4bcf5c server : fix crash when error handler dumps invalid utf-8 json (#9195) il y a 1 an
  Xuan Son Nguyen fc54ef0d1c server : support reading arguments from environment variables (#9105) il y a 1 an
  Xuan Son Nguyen 8b3befc0e2 server : refactor middleware and /health endpoint (#9056) il y a 1 an
  Riceball LEE 37501d9c79 server : fix duplicated n_predict key in the generation_settings (#8994) il y a 1 an
  Zhenwei Jin 4af8420afb common : remove duplicate function llama_should_add_bos_token (#8778) il y a 1 an
  Jiří Podivín 234b30676a server : init stop and error fields of the result struct (#9026) il y a 1 an
  compilade 98a532d474 server : fix segfault on long system prompt (#8987) il y a 1 an
  Georgi Gerganov 5ef07e25ac server : handle models with missing EOS token (#8997) il y a 1 an
  Mathieu Geli daef3ab233 server : add one level list nesting for embeddings (#8936) il y a 1 an
  Xuan Son Nguyen 1e6f6554aa server : add lora hotswap endpoint (WIP) (#8857) il y a 1 an
  Liu Jia 0a4ce78681 common : Changed tuple to struct (TODO fix) (#8823) il y a 1 an
  ardfork 978ba3d83d Server: Don't ignore llama.cpp params (#8754) il y a 1 an