Commit History

Autor SHA1 Mensaxe Data
  PAB a8cbab201d ggml: add `GGML_SET` Metal kernel + i32 CPU kernel (ggml/1037) hai 1 ano
  PAB c2082d93a8 ggml : add `GGML_PAD_REFLECT_1D` operation (ggml/1034) hai 1 ano
  Jeff Bolz 2759916d86 vulkan: Implement "fast divide" (mul+shift) for unary ops like copy (#10642) hai 1 ano
  PAB efb6ae9630 feat: add `GGML_UNARY_OP_ARGMAX` Metal kernel (ggml/1019) hai 1 ano
  Georgi Gerganov 0115df2f65 metal : small-batch mat-mul kernels (#10581) hai 1 ano
  Georgi Gerganov f0678c5ff4 ggml : fix I8MM Q4_1 scaling factor conversion (#10562) hai 1 ano
  Jeff Bolz 904109ed0d vulkan: fix group_norm (#10496) hai 1 ano
  Diego Devesa 5931c1f233 ggml : add support for dynamic loading of backends (#10469) hai 1 ano
  Diego Devesa a5e47592b6 cuda : optimize argmax (#10441) hai 1 ano
  Johannes Gäßler 02e4eaf22f ggml-opt: fix data corruption (ggml/1022) hai 1 ano
  Jeff Bolz b3e585988f vulkan: Optimize soft_max (#10301) hai 1 ano
  Johannes Gäßler 8a43e940ab ggml: new optimization interface (ggml/988) hai 1 ano
  Jeff Bolz 80dd7ff22f vulkan: Optimize contiguous copies (#10254) hai 1 ano
  Georgi Gerganov 841f27abdb metal : optimize FA kernels (#10171) hai 1 ano
  Zhiyuan Li 3bcd40b3c5 Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133) hai 1 ano
  Georgi Gerganov 5c333e0140 metal : add BF16 support (#8439) hai 1 ano
  Diego Devesa 9f40989351 ggml : move CPU backend to a separate file (#10144) hai 1 ano
  Johannes Gäßler c39665f589 CUDA: fix MMQ for non-contiguous src0, add tests (#10021) hai 1 ano
  Johannes Gäßler 80273a306d CUDA: fix 1D im2col, add tests (ggml/993) hai 1 ano
  Jun Hee Yoo 4c9388fb96 metal : add POOL2D and fix IM2COL (#9943) hai 1 ano
  Diego Devesa dca1d4b58a ggml : fix BLAS with unsupported types (#9775) hai 1 ano
  Diego Devesa 6374743747 ggml : add backend registry / device interfaces to BLAS backend (#9752) hai 1 ano
  Johannes Gäßler fabdc3bda3 ggml/ex: calculate accuracy in graph, adapt MNIST (ggml/980) hai 1 ano
  Diego Devesa c83ad6d01e ggml-backend : add device and backend reg interfaces (#9707) hai 1 ano
  Johannes Gäßler e98c1c188e test: fix OPT_STEP_ADAMW for test-backend-ops (ggml/974) hai 1 ano
  Johannes Gäßler 7254cdf7e8 ggml: fix gradient allocation logic (ggml/966) hai 1 ano
  slaren 1b2f992cd2 test-backend-ops : use flops for some performance tests (#9657) hai 1 ano
  Johannes Gäßler a5b57b08ce CUDA: enable Gemma FA for HIP/Pascal (#9581) hai 1 ano
  Molly Sophia 2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454) hai 1 ano
  Johannes Gäßler 424c5d00a9 ggml/examples: add backend support for numerical optimization (ggml/949) hai 1 ano