1
0

Коммит түүх

Эзэн SHA1 Мессеж Огноо
  Georgi Gerganov 4399f13fb9 server : remove obsolete --memory-f32 option 1 жил өмнө
  Fattire 5fb1574c81 A few small fixes to server's README docs (#6428) 1 жил өмнө
  slaren 280345968d cuda : rename build flag to LLAMA_CUDA (#6299) 1 жил өмнө
  Xuan Son Nguyen ad3a0505e3 Server: clean up OAI params parsing function (#6284) 1 жил өмнө
  Pierrick Hymbert f482bb2e49 common: llama_load_model_from_url split support (#6192) 1 жил өмнө
  Pierrick Hymbert 1997577d5e server: docs: `--threads` and `--threads`, `--ubatch-size`, `--log-disable` (#6254) 1 жил өмнө
  Jan Boon be07a03217 server : update readme doc from `slot_id` to `id_slot` (#6213) 1 жил өмнө
  Pierrick Hymbert d01b3c4c32 common: llama_load_model_from_url using --model-url (#6098) 1 жил өмнө
  Jakub N 828defefb6 Update server docker image URLs (#5997) 1 жил өмнө
  Xuan Son Nguyen caa106d4e0 Server: format error to json (#5961) 1 жил өмнө
  Georgi Gerganov 97c09585d6 server : clarify some items in the readme (#5957) 1 жил өмнө
  Xuan Son Nguyen 950ba1ab84 Server: reorganize some http logic (#5939) 1 жил өмнө
  Gabe Goodhart e1fa9569ba server : add SSL support (#5926) 1 жил өмнө
  Georgi Gerganov 2002bc96bf server : refactor (#5882) 1 жил өмнө
  Pierrick Hymbert 8ef969afce server : init http requests thread pool with --parallel if set (#5836) 1 жил өмнө
  Georgi Gerganov 38d16b1426 server : remove api_like_OAI.py proxy script (#5808) 1 жил өмнө
  Pierrick Hymbert 5cb02b4a01 server: allow to override threads server pool with --threads-http (#5794) 1 жил өмнө
  Pierrick Hymbert 8b350356b2 server: docs - refresh and tease a little bit more the http server (#5718) 1 жил өмнө
  Pierrick Hymbert 930b178026 server: logs - unified format and --log-format option (#5700) 1 жил өмнө
  Pierrick Hymbert d52d7819b8 server: concurrency fix + monitoring - add /metrics prometheus compatible endpoint (#5708) 1 жил өмнө
  Pierrick Hymbert 525213d2f5 server: init functional tests (#5566) 1 жил өмнө
  Alexey Parfenov c5688c6250 server : clarify some params in the docs (#5640) 1 жил өмнө
  Xuan Son Nguyen 7c8bcc11dc Add docs for llama_chat_apply_template (#5645) 1 жил өмнө
  Pierrick Hymbert 1ecea255eb server: health: fix race condition on slots data using tasks queue (#5634) 1 жил өмнө
  Pierrick Hymbert c0a8c6db37 server : health endpoint configurable failure on no slot (#5594) 1 жил өмнө
  Robey Holderith 5ee99c32f5 common, server : surface min_keep as its own parameter (#5567) 1 жил өмнө
  Pierrick Hymbert c145f8a132 server : slots monitoring endpoint (#5550) 1 жил өмнө
  Pierrick Hymbert e75c6279d1 server : enhanced health endpoint (#5548) 1 жил өмнө
  Pierrick Hymbert 36376abe05 server : --n-predict option document and cap to max value (#5549) 1 жил өмнө
  Alexey Parfenov 6dcc02d244 server : add "samplers" param to control the samplers order (#5494) 1 жил өмнө