Commit History

Autor SHA1 Mensaxe Data
  Xuan Son Nguyen 227d7c5a7f server : (UI) fix missing async generator on safari (#10857) hai 1 ano
  Eve 7b1ec53f56 vulkan: bugfixes for small subgroup size systems + llvmpipe test (#10809) hai 1 ano
  Zhiyuan Li 160bc039c8 rwkv6: add wkv6 support for Vulkan backend (#10829) hai 1 ano
  Georgi Gerganov 08ea539df2 unicode : improve naming style (#10838) hai 1 ano
  Georgi Gerganov 644fd71b44 sampling : refactor + optimize penalties sampler (#10803) hai 1 ano
  Bartowski 4ddd199f6f llava : Allow locally downloaded models for QwenVL (#10833) hai 1 ano
  Valentin Mamedov a0974156f3 llama : add Deepseek MoE v1 & GigaChat models (#10827) hai 1 ano
  Georgi Gerganov 87cf323cef scripts : change build path to "build-bench" for compare-commits.sh (#10836) hai 1 ano
  Vinesh Janarthanan 5478bbcd17 server: (UI) add syntax highlighting and latex math rendering (#10808) hai 1 ano
  Georgi Gerganov b5ae1ddff9 gguf-py : bump to v0.13.0 hai 1 ano
  Michelle Tan 89d604f2c8 server: Fix `has_next_line` in JSON response (#10818) hai 1 ano
  Evgeny Kurnevsky e52aba537a nix: allow to override rocm gpu targets (#10794) hai 1 ano
  HimariO ba1cb19cdd llama : add Qwen2VL support + multimodal RoPE (#10361) hai 1 ano
  cduk 56eea0781c Removes spurious \r in output that causes logging in journalctl to treat lines as binary and therefore hidden by default (#10771) hai 1 ano
  lhez a76c56fa1a Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (#10693) hai 1 ano
  Eric Curtin c27ac678dd Opt class for positional argument handling (#10508) hai 1 ano
  Corentin REGAL 11e07fd63b fix: graceful shutdown for Docker images (#10815) hai 1 ano
  Jett Janiak 4601a8bb67 gguf-py : numpy 2 newbyteorder fix (#9772) hai 1 ano
  谢乃闻 9f35e44592 Fix crash caused by ggml_backend_load_all when launching on Android Activity (#10812) hai 1 ano
  Eve 64ae065511 vulkan: small mul_mat_vec optimizations (#10665) hai 1 ano
  Akarshan Biswas 83ed24a97b SYCL: Reduce most of the compiler warnings (#10748) hai 1 ano
  Karol Kontny d583cd03f6 ggml : Fix compilation issues on ARM platform when building without fp16 (#10811) hai 1 ano
  Xuan Son Nguyen adffa6ffd5 common : improve -ctv -ctk CLI arguments (#10806) hai 1 ano
  Xuan Son Nguyen 274ec65af6 contrib : add ngxson as codeowner (#10804) hai 1 ano
  a3sh 8faa1d4dd4 CUDA: faster non-contiguous concat (#10760) hai 1 ano
  Diego Devesa cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797) hai 1 ano
  0cc4m 4064c0e3b6 Vulkan: Use improved q4_k and q5_k dequant code in dequant shaders (#10798) hai 1 ano
  0cc4m dc5301d565 Vulkan: Add VK_EXT_subgroup_size_control support to ensure full subgroups for coopmats (#10721) hai 1 ano
  Xuan Son Nguyen 9fdb124304 common : add missing env var for speculative (#10801) hai 1 ano
  CentricStorm 5555c0c1f6 docs: update server streaming mode documentation (#9519) hai 1 ano