提交历史

作者 SHA1 备注 提交日期
  shaofeiqi 4db5641210 opencl: add kernel to handle mat mul in attention to improve encoding speed (#17181) 2 月之前
  shani-f 72bd7321a7 sycl : unify unary kernels with a generic implementation and enable wide operator support (#17213) 2 月之前
  Aleksander Grygier 22e1ce2f81 webui: Fix clickability around chat processing statistics UI (#17278) 2 月之前
  Pascal 1411d9275a webui: add OAI-Compat Harmony tool-call streaming visualization and persistence in chat UI (#16618) 2 月之前
  Sigbjørn Skjæret 662192e1dc convert : remove unnecessary chat template patching (#17289) 2 月之前
  Jeff Bolz 24dc769f1b vulkan: Fuse mul_mat_id+add_id+mul and mul_mat+add+add. (#17287) 2 月之前
  Ruben Ortlam 4dca015b7e vulkan: Replace 16-bit unpack8 calls to work around legacy Windows AMD driver bug (#17285) 2 月之前
  Sigbjørn Skjæret 9a8860cf5d convert : use all parts in safetensors index (#17286) 2 月之前
  Sigbjørn Skjæret 9d3ef4809f convert : set expert gating func in base class (#17279) 2 月之前
  Ankur Verma c7b7db0445 mtmd-cli: Avoid logging to stdout for model loading messages in mtmd-cli (#17277) 2 月之前
  Giuseppe Scrivano 1568d13c2c vulkan: implement ABS and NEG (#17245) 2 月之前
  Jeff Bolz 439342ea0b vulkan: Use ggml_vk_tensor_subbuffer in mul_mat_vec(id) paths (#17244) 2 月之前
  Jeff Bolz 234ae7d7bd vulkan: skip all-negative-inf blocks in FA (#17186) 2 月之前
  Jeff Bolz 38eaf32af1 vulkan: change graph_compute to be async and enable get_tensor_async (#17158) 2 月之前
  Xuan-Son Nguyen 9b17d74ab7 mtmd: add mtmd_log_set (#17268) 2 月之前
  Bartowski e1fcf8b09b model : add AfmoeForCausalLM support (#16477) 2 月之前
  Marek Hradil jr. 6cd0cf72ce fix : Dangling pointer for non-empty trigger words in lazy grammar construction (#17048) 2 月之前
  Georgi Gerganov d396b43748 server : fix "can batch with" bug (#17263) 2 月之前
  Georgi Gerganov 45c6ef7307 metal : support argsort for ne00 > 1024 (#17247) 2 月之前
  Georgi Gerganov 2606b0adab metal : make the FA extra sizes consistent (#17143) 2 月之前
  ixgbe 307772fcda readme : add RVV,ZVFH,ZFH,ZICBOP support for RISC-V (#17259) 2 月之前
  Aleksander Grygier f1bad23f88 Better UX for handling multiple attachments in WebUI (#17246) 2 月之前
  Alberto Cabrera Pérez becc4816dd ggml-cpu: handle 3d tensors in repack mat_mul (#17241) 2 月之前
  Xuan-Son Nguyen c4abcb2457 server: fixing naming conflict res_error (#17243) 2 月之前
  Piotr Wilkin (ilintar) 389ac78b26 ggml : add ops SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM (#17063) 2 月之前
  Ruben Ortlam a19bd6f7ce vulkan: remove shell call from vulkan-shaders-gen tool, revert file check (#17219) 2 月之前
  Diego Devesa dd091e52f8 sched : fix reserve ignoring user tensor assignments (#17232) 2 月之前
  ixgbe 1215dde7b0 ggml-cpu : add RISC-V vector intrinsic support for silu and cvar operations (#17227) 2 月之前
  bagheera 0cfb19166b metal: accelerated conv2d (#17175) 2 月之前
  Georgi Gerganov 2776db6c81 Revert "ggml-cpu: handle 3d tensors in repack mat_mul (#17030)" (#17233) 2 月之前