Commit History

Author SHA1 Message Date
  Sigbjørn Skjæret 3ecb2f671a ggml : implement set_rows with i32 index (#16159) 4 months ago
  Shin-myoung-serp 96fdca043b Vulkan: add conv_transpose_2d operation (#16022) 4 months ago
  Jeff Bolz a20d810d79 vulkan: add RTE variants of exp shader (#16165) 4 months ago
  Giuseppe Scrivano 1eeb523c3e vulkan: optimize UMA buffer operations and fix driver hangs (#16059) 4 months ago
  Jeff Bolz 5bb4a3edec vulkan: fix validation error about VK_PIPELINE_CREATE_CAPTURE_STATISTICS_BIT_KHR (#16086) 4 months ago
  Jeff Bolz c0b45097c3 rename optimize_graph to graph_optimize (#16082) 4 months ago
  Eve cb5bb6cc05 vulkan: automatically remove unsupported devices (#15976) 4 months ago
  Ruben Ortlam 261e6a20ff Vulkan: Clean up mul_mm shader (#15987) 4 months ago
  Jeff Bolz b9c9c9f789 vulkan: initialize vulkan-hpp to allow using extension function pointers (#15705) 4 months ago
  Ruben Ortlam 304ac5693d Vulkan iGPU device selection overhaul and PCI ID API support (#15947) 4 months ago
  Mathieu Baudier 6c88ad8fa7 vulkan: Make device memory check more portable (#15939) 4 months ago
  Diego Devesa 360d6533db ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797) 4 months ago
  Ruben Ortlam ae355f6f71 vulkan: throw the oom error instead of no memory type found (#15905) 4 months ago
  Jeff Bolz 4f63cd705c vulkan: Fix OOB accesses in soft_max_back (#15861) 4 months ago
  lksj92hs ed54e32558 Workaround for subgroup arithmetic failing on MoltenVK with AMD GPUs (issue 15846) (#15886) 4 months ago
  Jeff Bolz e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850) 4 months ago
  Xuan-Son Nguyen 9fcb29f22f ggml: allow casting between f32 and i32 (#15783) 4 months ago
  Jeff Bolz 3976dfbe00 vulkan: support im2col_3d (#15795) 4 months ago
  Jeff Bolz c97b5e5854 vulkan: Support pad_ext (#15794) 4 months ago
  leejet 0a1b3982cd ggml: add ops for WAN video model (cuda && cpu) (#15669) 4 months ago
  Ruben Ortlam dff7551bfd vulkan: fix mmv subgroup16 selection (#15775) 4 months ago
  Jeff Bolz 0fce7a1248 vulkan: don't use std::string in load_shaders, to improve compile time (#15724) 4 months ago
  Daniel Bevenius 8227695d7a vulkan : update ggml_vk_instance_validation_ext_available (#15666) 4 months ago
  Shin-myoung-serp 0014fb4add ggml vulkan: add hardsigmoid and hardswish operations (#15762) 4 months ago
  Jeff Bolz 25f1045f07 vulkan: Fix macro parameter order for f32 matmul shaders (#15716) 4 months ago
  Gilad S. d4d8dbe383 vulkan: use memory budget extension to read memory usage (#15545) 4 months ago
  Ruben Ortlam fec7911f8f vulkan: disable large mmv subgroups on older Nvidia GPUs (#15717) 4 months ago
  Ruben Ortlam 02c1813517 Vulkan: Add Integer Dot Product mul_mat_vec shader for legacy quants (#14903) 5 months ago
  Jeff Bolz bbbf5ecccb vulkan: handle large sizes for get_rows (#15686) 5 months ago
  Jeff Bolz c37052ab4d vulkan: mul_mat_id coopmat2 optimizations (#15546) 5 months ago