Commit History

Author SHA1 Message Date
  lhez c5023daf60 opencl: support imrope (#16914) 3 months ago
  Acly 10640e31aa ggml : fix interpolate with align-corners and ne=1 (#16700) 3 months ago
  lhez 6ea37f5739 opencl: fix warnings and clean up profiling (#16688) 3 months ago
  Shawn Gu 81387858f1 opencl: transposed gemm/gemv moe kernel with mxfp4,f32 (#16602) 3 months ago
  lhez 0cb7a0683b opencl: add q8_0 mm support (#16469) 3 months ago
  Aman Gupta 120bf7046d CUDA + openCL: fix bug in accessing rms_norm->src while doing fusion (#16577) 3 months ago
  lhez 5016b72862 opencl: fix build targeting CL 2 (#16554) 4 months ago
  lhez 7c156df414 opencl: support pad_ext (#15888) 4 months ago
  lhez d1c84a662d opencl: support ne3 in get_rows (#15866) 4 months ago
  Sigbjørn Skjæret 3ecb2f671a ggml : implement set_rows with i32 index (#16159) 4 months ago
  lhez 51f5a45fbe opencl: fix concat crash on win arm64 with Adreno (#15944) 4 months ago
  lhez c4510dc937 opencl: initial `q8_0` mv support (#15732) 4 months ago
  Shawn Gu 3edd87cd05 opencl: optimize mxfp4 kernels (#16037) 4 months ago
  Jeff Bolz c0b45097c3 rename optimize_graph to graph_optimize (#16082) 4 months ago
  Jeff Bolz e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850) 5 months ago
  leejet 0a1b3982cd ggml: add ops for WAN video model (cuda && cpu) (#15669) 5 months ago
  rmatif 820bc98531 opencl: add hs=40 to FA (#15758) 5 months ago
  rmatif 97669e4073 opencl: add attn sinks support for FA kernels (#15706) 5 months ago
  rmatif 86076f92de OpenCL: add fused group_norm/norm, mul, add (#15314) 5 months ago
  lhez f7207b0415 opencl: fix support ops condition for `rms_norm` (#15560) 5 months ago
  lhez fb22dd07a6 opencl: mark `argsort` unsupported if cols exceed workgroup limit (#15375) 5 months ago
  rmatif 912ff8c119 OpenCL: add initial FA support (#14987) 5 months ago
  lhez e2c1bfff53 opencl: add initial mxfp4 support via mv (#15270) 5 months ago
  rmatif 60a7658810 opencl: allow mixed f16/f32 `add` (#15140) 6 months ago
  AN Long cd6983d56d ggml : fix field name when new ggml_backend (#14944) 6 months ago
  lhez aaa3d07ae7 opencl: support sink in `soft_max` (attn sinks) (#15152) 6 months ago
  rmatif 756cfea826 fix profiling crash (#15072) 6 months ago
  lhez e725a1a982 opencl: add `swiglu_oai` and `add_id` (#15121) 6 months ago
  Georgi Gerganov fd1234cb46 llama : add gpt-oss (#15091) 6 months ago
  lhez 5c0eb5ef54 opencl: fix adreno compiler detection logic (#15029) 6 months ago