Commit History

Author SHA1 Message Date
  Ruben Ortlam 4dca015b7e vulkan: Replace 16-bit unpack8 calls to work around legacy Windows AMD driver bug (#17285) 2 months ago
  Sigbjørn Skjæret 9a8860cf5d convert : use all parts in safetensors index (#17286) 2 months ago
  Sigbjørn Skjæret 9d3ef4809f convert : set expert gating func in base class (#17279) 2 months ago
  Ankur Verma c7b7db0445 mtmd-cli: Avoid logging to stdout for model loading messages in mtmd-cli (#17277) 2 months ago
  Giuseppe Scrivano 1568d13c2c vulkan: implement ABS and NEG (#17245) 2 months ago
  Jeff Bolz 439342ea0b vulkan: Use ggml_vk_tensor_subbuffer in mul_mat_vec(id) paths (#17244) 2 months ago
  Jeff Bolz 234ae7d7bd vulkan: skip all-negative-inf blocks in FA (#17186) 2 months ago
  Jeff Bolz 38eaf32af1 vulkan: change graph_compute to be async and enable get_tensor_async (#17158) 2 months ago
  Xuan-Son Nguyen 9b17d74ab7 mtmd: add mtmd_log_set (#17268) 2 months ago
  Bartowski e1fcf8b09b model : add AfmoeForCausalLM support (#16477) 2 months ago
  Marek Hradil jr. 6cd0cf72ce fix : Dangling pointer for non-empty trigger words in lazy grammar construction (#17048) 2 months ago
  Georgi Gerganov d396b43748 server : fix "can batch with" bug (#17263) 2 months ago
  Georgi Gerganov 45c6ef7307 metal : support argsort for ne00 > 1024 (#17247) 2 months ago
  Georgi Gerganov 2606b0adab metal : make the FA extra sizes consistent (#17143) 2 months ago
  ixgbe 307772fcda readme : add RVV,ZVFH,ZFH,ZICBOP support for RISC-V (#17259) 2 months ago
  Aleksander Grygier f1bad23f88 Better UX for handling multiple attachments in WebUI (#17246) 2 months ago
  Alberto Cabrera Pérez becc4816dd ggml-cpu: handle 3d tensors in repack mat_mul (#17241) 2 months ago
  Xuan-Son Nguyen c4abcb2457 server: fixing naming conflict res_error (#17243) 2 months ago
  Piotr Wilkin (ilintar) 389ac78b26 ggml : add ops SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM (#17063) 2 months ago
  Ruben Ortlam a19bd6f7ce vulkan: remove shell call from vulkan-shaders-gen tool, revert file check (#17219) 2 months ago
  Diego Devesa dd091e52f8 sched : fix reserve ignoring user tensor assignments (#17232) 2 months ago
  ixgbe 1215dde7b0 ggml-cpu : add RISC-V vector intrinsic support for silu and cvar operations (#17227) 2 months ago
  bagheera 0cfb19166b metal: accelerated conv2d (#17175) 2 months ago
  Georgi Gerganov 2776db6c81 Revert "ggml-cpu: handle 3d tensors in repack mat_mul (#17030)" (#17233) 2 months ago
  Diego Devesa 879dec341a ggml-cpu : use template for argsort (#17222) 2 months ago
  TecJesh 97d5117217 CANN: Add cross_entropy_loss op support (#16886) 2 months ago
  Aman Gupta a90eb94ca9 CUDA: fuse rope + set_rows (#16884) 2 months ago
  Neo Zhang Jianyu 07751f8d44 update SYCL support OPs (#17208) 2 months ago
  o7si ffb6f3d921 vocab : correct bounds check for UGM XCDA array access (#17215) 2 months ago
  Johannes Gäßler 5d6838b74f CUDA: static assert to prevent misuse of memcpy_1 (#17198) 2 months ago