تاریخچه Commit ها

نویسنده SHA1 پیام تاریخ
  Georgi Gerganov 152610eda9 server : output embeddings for all tokens when pooling = none (#10861) 1 سال پیش
  Georgi Gerganov 0e70ba686e server : add "tokens" output (#10853) 1 سال پیش
  Xuan Son Nguyen 46828872c3 server : (embeddings) using same format for "input" and "content" (#10872) 1 سال پیش
  krystiancha 05c3a444b8 server : fill usage info in embeddings and rerank responses (#10852) 1 سال پیش
  Georgi Gerganov 644fd71b44 sampling : refactor + optimize penalties sampler (#10803) 1 سال پیش
  Vinesh Janarthanan 5478bbcd17 server: (UI) add syntax highlighting and latex math rendering (#10808) 1 سال پیش
  Michelle Tan 89d604f2c8 server: Fix `has_next_line` in JSON response (#10818) 1 سال پیش
  cduk 56eea0781c Removes spurious \r in output that causes logging in journalctl to treat lines as binary and therefore hidden by default (#10771) 1 سال پیش
  Yüg a86ad841f1 server : add flag to disable the web-ui (#10762) (#10751) 1 سال پیش
  Xuan Son Nguyen ce8784bdb1 server : fix format_infill (#10724) 1 سال پیش
  Xuan Son Nguyen e52522b869 server : bring back info of final chunk in stream mode (#10722) 1 سال پیش
  Xuan Son Nguyen 3573fa8e7b server : (refactor) no more json in server_task input (#10691) 1 سال پیش
  Georgi Gerganov ce4a7b8493 server : various fixes (#10704) 1 سال پیش
  Georgi Gerganov c2a16c0bdb server : fix free of spec context and batch (#10651) 1 سال پیش
  Xuan Son Nguyen 6c5bc0625f server : (refactoring) do not rely on JSON internally (#10643) 1 سال پیش
  Georgi Gerganov 1da7b76569 server : fix speculative decoding with context shift (#10641) 1 سال پیش
  Xuan Son Nguyen 91c36c269b server : (web ui) Various improvements, now use vite as bundler (#10599) 1 سال پیش
  Georgi Gerganov 70b98fadbc server : fix default draft model parameters (#10586) 1 سال پیش
  haopeng 64ed2091b2 server: Add "tokens per second" information in the backend (#10548) 1 سال پیش
  alek3y 86dc11c5bc server : bind to any port when specified (#10590) 1 سال پیش
  Georgi Gerganov 84e1c33cde server : fix parallel speculative decoding (#10513) 1 سال پیش
  Georgi Gerganov 47f931c8f9 server : enable cache_prompt by default (#10501) 1 سال پیش
  Diego Devesa 10bce0450f llama : accept a list of devices to use to offload a model (#10497) 1 سال پیش
  Georgi Gerganov 9ca2e67762 server : add speculative decoding support (#10455) 1 سال پیش
  Georgi Gerganov d9d54e498d speculative : refactor and add a simpler example (#10362) 1 سال پیش
  MaggotHATE bcdb7a2386 server: (web UI) Add samplers sequence customization (#10255) 1 سال پیش
  Xuan Son Nguyen 9901068ac7 server : (web UI) add copy button for code block, fix api key (#10242) 1 سال پیش
  Jhen-Jie Hong 0e712a5acb server : fix incorrect res in validate_model_chat_template (#10272) 1 سال پیش
  Xuan Son Nguyen a71d81cf8c server : revamp chat UI with vuejs and daisyui (#10175) 1 سال پیش
  Georgi Gerganov b11f9ba9b8 server : remove hack for extra parallel slot (#10187) 1 سال پیش