Commit History

Autor SHA1 Mensaxe Data
  Georgi Gerganov 36319dec5d tts : small QoL for easy model fetch (#10903) hai 1 ano
  Xuan Son Nguyen 57bb2c40cd server : fix logprobs, make it OAI-compatible (#10783) hai 1 ano
  Adrien Gallouët a3c33b1dce ggml: fix arm build with gcc (#10895) hai 1 ano
  Sukriti Sharma 2fffc52b50 llama : fix Roberta embeddings (#10856) hai 1 ano
  fairydreaming 7585edbdeb convert : Add support for Microsoft Phi-4 model (#10817) hai 1 ano
  Johannes Gäßler cd920d0ac3 tests: disable GGUF test for bad value size (#10886) hai 1 ano
  Eric Curtin 7909e8588d llama-run : improve progress bar (#10821) hai 1 ano
  Diego Devesa 9177484f58 ggml : fix arm build (#10890) hai 1 ano
  Georgi Gerganov 0bf2d10c55 tts : add OuteTTS support (#10784) hai 1 ano
  Gaetan Bisson 7bbb5acf12 server: avoid overwriting Authorization header (#10878) hai 1 ano
  Georgi Gerganov 152610eda9 server : output embeddings for all tokens when pooling = none (#10861) hai 1 ano
  Georgi Gerganov 0e70ba686e server : add "tokens" output (#10853) hai 1 ano
  Xuan Son Nguyen 46828872c3 server : (embeddings) using same format for "input" and "content" (#10872) hai 1 ano
  redbeard 6b064c92b4 docs: Fix HIP (née hipBLAS) in README (#10880) hai 1 ano
  Diego Devesa 4da69d1abd Revert "llama : add Falcon3 support (#10864)" (#10876) hai 1 ano
  DAN™ d62b532c52 Use model->gguf_kv for loading the template instead of using the C API. (#10868) hai 1 ano
  Johannes Gäßler 081b29bd2a tests: add tests for GGUF (#10830) hai 1 ano
  Georgi Gerganov 5437d4aaf5 sync : ggml hai 1 ano
  Georgi Gerganov 78f766768d cmake : fix "amd64" processor string (whisper/2638) hai 1 ano
  gn64 8dd19a4812 vulkan : fix soft_max.comp division by zero (whisper/2633) hai 1 ano
  Daniel Bevenius 130d0c90bd ggml : remove return from ggml_gallocr_allocate_node (ggml/1048) hai 1 ano
  Daniel Bevenius 3919da8e33 ggml : add check for grad_accs (ggml/1046) hai 1 ano
  Georgi Gerganov 0006f5a74a ggml : update ggml_backend_cpu_device_supports_op (#10867) hai 1 ano
  krystiancha 05c3a444b8 server : fill usage info in embeddings and rerank responses (#10852) hai 1 ano
  Billel Mokeddem 382bc7f2e8 llama : add Falcon3 support (#10864) hai 1 ano
  Ruan 4f51968aca readme : update typos (#10863) hai 1 ano
  Xuan Son Nguyen 227d7c5a7f server : (UI) fix missing async generator on safari (#10857) hai 1 ano
  Eve 7b1ec53f56 vulkan: bugfixes for small subgroup size systems + llvmpipe test (#10809) hai 1 ano
  Zhiyuan Li 160bc039c8 rwkv6: add wkv6 support for Vulkan backend (#10829) hai 1 ano
  Georgi Gerganov 08ea539df2 unicode : improve naming style (#10838) hai 1 ano