Commit History

Author SHA1 Message Date
  Pascal b1846f1c8e webui: add rehype plugin to restore HTML in Markdown table cells (#17477) 1 month ago
  Jeff Bolz d414db02d3 vulkan: Use fewer rows for scalar FA when HS is not a multiple of 16 (#17455) 1 month ago
  Aaron Teo 877566d512 llama: introduce support for model-embedded sampling parameters (#17120) 1 month ago
  Jeff Bolz 3d07caa99b vulkan: more FA details in vk_perf_logger (#17443) 1 month ago
  Daniel Bevenius 134e6940ca llama : skip output reordering for single token batches (#17466) 1 month ago
  Jiacheng (Jason) Chen 0543f928a3 HIP: WMMA-MMQ kernels for RDNA 4 (#17156) 1 month ago
  Sigbjørn Skjæret b61de2b2df convert : allow quantizing lora again (#17453) 1 month ago
  Xuan-Son Nguyen b8372eecd9 server: split server.cpp code into server/common/task/queue (#17362) 1 month ago
  Daniel Bevenius 6ab8eacddf examples : add -kvu to batched usage example [no ci] (#17469) 1 month ago
  Georgi Gerganov 2d50b9d8cb sync : ggml 1 month ago
  Daniel Bevenius 697edfeead ggml : remove dirty flag from version string (ggml/1391) 1 month ago
  Alberto Cabrera Pérez dbb852b549 ggml-cpu: arm64: q4_K repack gemm and gemv implementations (i8mm) (#16739) 1 month ago
  ixgbe 5f55c385cb ggml: add RISC-V cpu-feats (#17461) 1 month ago
  william pan 4902eebe33 models : Added support for RND1 Diffusion Language Model (#17433) 2 months ago
  Max Krasnyansky 923ae3c619 hexagon: add support for ROPE_NEOX (#17458) 2 months ago
  Raul Torres 01ad35e6d6 CANN: Define `cann_graph_update_required` before macro (#17434) 2 months ago
  M. Mediouni fcb013847c ggml-hexagon: Initial Hexagon v68/v69 support (#17394) 2 months ago
  nullname d5bc1ad110 ggml-hexagon: add `hex_supported_buffer` for better buffer supported check (#17212) 2 months ago
  Pascal 0c7220db56 webui: minor settings reorganization and add disable autoscroll option (#17452) 2 months ago
  Sigbjørn Skjæret 96ac5a2329 cuda : support non-contiguous i32 to i32 copy (#17326) 2 months ago
  Eric Curtin bc809e9c53 vulkan: Update docker image to Ubuntu 26.04 to enable glslc features (#17439) 2 months ago
  Jeff Bolz 54d83bbe85 vulkan: remove a couple unnecessary switches (#17419) 2 months ago
  Adrien Gallouët 4949ac0f18 ci : switch to BoringSSL on Server workflow (#17441) 2 months ago
  Masato Nakasaka 3f3a4fb9c3 Revive MUL_MAT_ID to perf testing (#17397) 2 months ago
  yulo 028f93ef98 HIP: RDNA4 tensor core support for MMF (#17077) 2 months ago
  lhez 8e9ddba610 opencl: refine condition for kqv mm (#17392) 2 months ago
  ubergarm 23bc779a6e model : detect GigaChat3-10-A1.8B as deepseek lite (#17420) 2 months ago
  Adrien Gallouët 28175f857d cmake : add option to build and link BoringSSL (#17205) 2 months ago
  Adrien Gallouët 9cc4080441 ci : start using OpenSSL (#17235) 2 months ago
  Jeff Bolz f1ffbba68e vulkan: disable async for older Intel devices (#17369) 2 months ago