Georgi Gerganov
|
4399f13fb9
server : remove obsolete --memory-f32 option
|
1 жил өмнө |
Fattire
|
5fb1574c81
A few small fixes to server's README docs (#6428)
|
1 жил өмнө |
slaren
|
280345968d
cuda : rename build flag to LLAMA_CUDA (#6299)
|
1 жил өмнө |
Xuan Son Nguyen
|
ad3a0505e3
Server: clean up OAI params parsing function (#6284)
|
1 жил өмнө |
Pierrick Hymbert
|
f482bb2e49
common: llama_load_model_from_url split support (#6192)
|
1 жил өмнө |
Pierrick Hymbert
|
1997577d5e
server: docs: `--threads` and `--threads`, `--ubatch-size`, `--log-disable` (#6254)
|
1 жил өмнө |
Jan Boon
|
be07a03217
server : update readme doc from `slot_id` to `id_slot` (#6213)
|
1 жил өмнө |
Pierrick Hymbert
|
d01b3c4c32
common: llama_load_model_from_url using --model-url (#6098)
|
1 жил өмнө |
Jakub N
|
828defefb6
Update server docker image URLs (#5997)
|
1 жил өмнө |
Xuan Son Nguyen
|
caa106d4e0
Server: format error to json (#5961)
|
1 жил өмнө |
Georgi Gerganov
|
97c09585d6
server : clarify some items in the readme (#5957)
|
1 жил өмнө |
Xuan Son Nguyen
|
950ba1ab84
Server: reorganize some http logic (#5939)
|
1 жил өмнө |
Gabe Goodhart
|
e1fa9569ba
server : add SSL support (#5926)
|
1 жил өмнө |
Georgi Gerganov
|
2002bc96bf
server : refactor (#5882)
|
1 жил өмнө |
Pierrick Hymbert
|
8ef969afce
server : init http requests thread pool with --parallel if set (#5836)
|
1 жил өмнө |
Georgi Gerganov
|
38d16b1426
server : remove api_like_OAI.py proxy script (#5808)
|
1 жил өмнө |
Pierrick Hymbert
|
5cb02b4a01
server: allow to override threads server pool with --threads-http (#5794)
|
1 жил өмнө |
Pierrick Hymbert
|
8b350356b2
server: docs - refresh and tease a little bit more the http server (#5718)
|
1 жил өмнө |
Pierrick Hymbert
|
930b178026
server: logs - unified format and --log-format option (#5700)
|
1 жил өмнө |
Pierrick Hymbert
|
d52d7819b8
server: concurrency fix + monitoring - add /metrics prometheus compatible endpoint (#5708)
|
1 жил өмнө |
Pierrick Hymbert
|
525213d2f5
server: init functional tests (#5566)
|
1 жил өмнө |
Alexey Parfenov
|
c5688c6250
server : clarify some params in the docs (#5640)
|
1 жил өмнө |
Xuan Son Nguyen
|
7c8bcc11dc
Add docs for llama_chat_apply_template (#5645)
|
1 жил өмнө |
Pierrick Hymbert
|
1ecea255eb
server: health: fix race condition on slots data using tasks queue (#5634)
|
1 жил өмнө |
Pierrick Hymbert
|
c0a8c6db37
server : health endpoint configurable failure on no slot (#5594)
|
1 жил өмнө |
Robey Holderith
|
5ee99c32f5
common, server : surface min_keep as its own parameter (#5567)
|
1 жил өмнө |
Pierrick Hymbert
|
c145f8a132
server : slots monitoring endpoint (#5550)
|
1 жил өмнө |
Pierrick Hymbert
|
e75c6279d1
server : enhanced health endpoint (#5548)
|
1 жил өмнө |
Pierrick Hymbert
|
36376abe05
server : --n-predict option document and cap to max value (#5549)
|
1 жил өмнө |
Alexey Parfenov
|
6dcc02d244
server : add "samplers" param to control the samplers order (#5494)
|
1 жил өмнө |