Commit History

Автор SHA1 Съобщение Дата
  Jeff Bolz c6c5e85979 vulkan: support solve_tri with larger N/K values (#17781) преди 1 месец
  Jeff Bolz a0f3897d53 vulkan: fix top_k bug when there are ties in the input (#17659) преди 1 месец
  Acly e15cd06a94 vulkan : support conv-2d with large output size (#17685) преди 1 месец
  Piotr Wilkin (ilintar) 96fe9badfc Add support for CUMSUM and TRI for CUDA. (#17584) преди 1 месец
  Reese Levine 7ca5991d2b ggml webgpu: add support for emscripten builds (#17184) преди 1 месец
  Tarek Dakhran 2ba719519d model: LFM2-VL fixes (#17577) преди 1 месец
  Jeff Bolz 59d8d4e963 vulkan: improve topk perf for large k, fix overflow in unit tests (#17582) преди 1 месец
  Piotr Wilkin (ilintar) cd0e3a7a3b SOLVE_TRI CUDA kernel for small matrices (#17457) преди 1 месец
  Jeff Bolz 879d673759 vulkan: Implement top-k (#17418) преди 1 месец
  Georgi Gerganov 583cb83416 ggml : add ggml_top_k (#17365) преди 1 месец
  Jeff Bolz d414db02d3 vulkan: Use fewer rows for scalar FA when HS is not a multiple of 16 (#17455) преди 1 месец
  Sigbjørn Skjæret 96ac5a2329 cuda : support non-contiguous i32 to i32 copy (#17326) преди 2 месеца
  Masato Nakasaka 3f3a4fb9c3 Revive MUL_MAT_ID to perf testing (#17397) преди 2 месеца
  Giuseppe Scrivano 7d77f07325 vulkan: implement ADD1, ARANGE, FILL, SOFTPLUS, STEP, ROUND, CEIL, FLOOR, TRUNC (#17319) преди 2 месеца
  Jeff Bolz 1fa4551af0 vulkan: support larger argsort (#17313) преди 2 месеца
  Piotr Wilkin (ilintar) 6fd4f95367 Fix too relaxed check on CUDA "fast copy" (can_be_transposed) condition (#17332) преди 2 месеца
  Georgi Gerganov 1a139644a8 metal : add cumsum (#17305) преди 2 месеца
  Jeff Bolz 24dc769f1b vulkan: Fuse mul_mat_id+add_id+mul and mul_mat+add+add. (#17287) преди 2 месеца
  Georgi Gerganov 45c6ef7307 metal : support argsort for ne00 > 1024 (#17247) преди 2 месеца
  Piotr Wilkin (ilintar) 389ac78b26 ggml : add ops SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM (#17063) преди 2 месеца
  Diego Devesa 879dec341a ggml-cpu : use template for argsort (#17222) преди 2 месеца
  duduta 73460f6278 ggml-cpu: templateify ggml_compute_forward_rope_f32 and _f16 (#16805) преди 2 месеца
  Acly 1032256ec9 cuda/vulkan : bicubic interpolation (#17022) преди 2 месеца
  Ruben Ortlam 8a3519b708 vulkan: fix mmq out of bounds reads (#17108) преди 2 месеца
  Jeff Bolz 80a6cf6347 vulkan: fuse mul_mat_id + mul (#17095) преди 2 месеца
  Aman Gupta 64fe17fbb8 Revert "CUDA: add expert reduce kernel (#16857)" (#17100) преди 2 месеца
  Aman Gupta c1b187688d CUDA: skip fusion for repeating adds in bias (#17080) преди 2 месеца
  Jeff Bolz b4e335d8dc vulkan: fuse rms_norm + mul + rope (+ view + set_rows) (#16977) преди 2 месеца
  bssrdf 299f5d782c CUDA: properly handle nb00=nb02 case for cpy (#17081) преди 2 месеца
  Johannes Gäßler aa374175c3 CUDA: fix crash on uneven context without FA (#16988) преди 2 месеца