Historique des commits

Auteur SHA1 Message Date
  Georgi Gerganov d00cbea63c server : host-memory prompt caching (#16391) il y a 3 mois
  65a 4afb0a746f server : Support multimodal completion and embeddings prompts in JSON format (#15108) il y a 4 mois
  Georgi Gerganov d2fcd91cf9 server : disable context shift by default (#15416) il y a 5 mois
  Lukas Straub a9f77a8be3 server : add openai-style logit_bias support (#14946) il y a 5 mois
  Olivier Chafik f13847cfb5 server: fix regression on streamed non-chat completion w/ stops (#13785) il y a 7 mois
  Xuan-Son Nguyen 360a9c98e1 server : fix cache_tokens bug with no cache_prompt (#13533) il y a 8 mois
  Diego Devesa 1d36b3670b llama : move end-user examples to tools directory (#13249) il y a 8 mois