Historique des commits

Auteur SHA1 Message Date
  Reza Kakhki 9ba399dfa7 server : add support for "encoding_format": "base64" to the */embeddings endpoints (#10967) il y a 1 an
  Djip007 2cd43f4900 ggml : more perfo with llamafile tinyblas on x86_64 (#10714) il y a 1 an
  NeverLucky 09fe2e7613 server: allow filtering llama server response fields (#10940) il y a 1 an
  Georgi Gerganov 30caac3a68 llama : the WPM vocabs use the CLS token as BOS (#10930) il y a 1 an
  Diego Devesa 60cfa728e2 ggml : use wstring for backend search paths (#10960) il y a 1 an
  Diego Devesa 3327bb0f8d ggml : fix arm enabled features check (#10961) il y a 1 an
  Diego Devesa 32d6ee6385 ggml : fix const usage in SSE path (#10962) il y a 1 an
  Xuan Son Nguyen 14b699ecde server : fix missing model id in /model endpoint (#10957) il y a 1 an
  Xuan Son Nguyen 485dc01214 server : add system_fingerprint to chat/completion (#10917) il y a 1 an
  Radoslav Gerganov 86bf31cfe6 rpc-server : add support for the SYCL backend (#10934) il y a 1 an
  Yun Dou b92a14a841 llama : support InfiniAI Megrez 3b (#10893) il y a 1 an
  ymcki 6f0c9e034b llama : support for Llama-3_1-Nemotron-51B (#10669) il y a 1 an
  Eric Curtin dab76c92cc llama-run : include temperature option (#10899) il y a 1 an
  yuri@FreeBSD 7024d59e6a ggml : fix run-time on FreeBSD in get_executable_path() (#10948) il y a 1 an
  Rudi Servo 7c0e285858 devops : add docker-multi-stage builds (#10832) il y a 1 an
  Billel Mokeddem 7ae33a616f llama : add Falcon3 support (#10883) il y a 1 an
  Jeff Bolz ebdee9478c vulkan: build fixes for 32b (#10927) il y a 1 an
  Georgi Gerganov 5cd85b5e00 convert : add BertForMaskedLM (#10919) il y a 1 an
  Jeff Bolz a91a41364b vulkan: optimize coopmat2 dequant functions (#10855) il y a 1 an
  Adrien Gallouët e34c5af43f ggml-cpu: replace NEON asm with intrinsics in ggml_gemv_q4_0_4x8_q8_0() (#10874) il y a 1 an
  Akarshan Biswas eb5c3dc64b SYCL: Migrate away from deprecated ggml_tensor->backend (#10840) il y a 1 an
  Xuan Son Nguyen 0ca416c91a server : (UI) fix copy to clipboard function (#10916) il y a 1 an
  Diego Devesa 21ae3b9be8 ggml : add test for SVE and disable when it fails (#10906) il y a 1 an
  Molly Sophia 0a11f8b7b5 convert : fix RWKV v6 model conversion (#10913) il y a 1 an
  Georgi Gerganov d408bb9268 clip : disable GPU support (#10896) il y a 1 an
  Georgi Gerganov 5cab3e4aaa llama : minor grammar refactor (#10897) il y a 1 an
  Georgi Gerganov 36319dec5d tts : small QoL for easy model fetch (#10903) il y a 1 an
  Xuan Son Nguyen 57bb2c40cd server : fix logprobs, make it OAI-compatible (#10783) il y a 1 an
  Adrien Gallouët a3c33b1dce ggml: fix arm build with gcc (#10895) il y a 1 an
  Sukriti Sharma 2fffc52b50 llama : fix Roberta embeddings (#10856) il y a 1 an