Commit History

Author SHA1 Message Date
  Xuan Son Nguyen caa106d4e0 Server: format error to json (#5961) 1 year ago
  Minsoo Cheong 332bdfd798 server : maintain chat completion id for streaming responses (#5988) 1 year ago
  Georgi Gerganov 2002bc96bf server : refactor (#5882) 1 year ago
  Pierrick Hymbert 9731134296 server: tests: passkey challenge / self-extend with context shift demo (#5832) 1 year ago
  Xuan Son Nguyen 052051d8ae Server: normalize naming (#5779) 1 year ago
  Pierrick Hymbert 930b178026 server: logs - unified format and --log-format option (#5700) 1 year ago
  Pierrick Hymbert d52d7819b8 server: concurrency fix + monitoring - add /metrics prometheus compatible endpoint (#5708) 1 year ago
  Pierrick Hymbert 1ecea255eb server: health: fix race condition on slots data using tasks queue (#5634) 1 year ago
  Xuan Son Nguyen 9c405c9f9a Server: use llama_chat_apply_template (#5593) 1 year ago
  Daniel Hiltgen 66c1968f7a server : graceful server shutdown (#5244) 1 year ago
  Xuan Son Nguyen 907e08c110 server : add llama2 chat template (#5425) 1 year ago
  Georgi Gerganov 753eafed0e sync : ggml 2 years ago
  Xuan Son Nguyen 48c857aa10 server : refactored the task processing logic (#5065) 2 years ago