Commit History

Author SHA1 Message Date
  Piotr Wilkin (ilintar) 389ac78b26 ggml : add ops SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM (#17063) 2 months ago
  Jeff Bolz 10fcc41290 vulkan: Update topk_moe fusion to handle gpt's late softmax (#16656) 3 months ago
  Aman Gupta 4926419c4d ggml: add ggml_can_fuse_subgraph (#16662) 3 months ago
  Jeff Bolz e56abd2098 vulkan: Implement topk_moe fused shader, ported from CUDA (#16641) 3 months ago
  Piotr Wilkin (ilintar) 34fcc5a4ac model : Apertus model implementation (#15852) 4 months ago
  Acly f2a789e334 ggml : split graph allocations according to backend max buffer size (#15815) 4 months ago
  Georgi Gerganov 4f324a556c ggml : extend ggml_can_fuse to work with non-sequential nodes (#16123) 4 months ago
  Georgi Gerganov a71ae3ba7a ggml : add ggml_op_is_empty (#16122) 4 months ago
  Georgi Gerganov fd1234cb46 llama : add gpt-oss (#15091) 6 months ago
  Georgi Gerganov bf9087f59a metal : fuse add, mul + add tests (#14596) 6 months ago
  Jeff Bolz bd9c981d72 vulkan: Add fusion support for RMS_NORM+MUL (#14366) 7 months ago
  Aaron Teo 60ef23d6c1 ggml-cpu: enable IBM NNPA Vector Intrinsics (#14317) 7 months ago
  Daniel Tang fedf034a98 ggml : Print backtrace on uncaught C++ exceptions (ggml/1232) 8 months ago
  xctan 05f6ac6283 ggml : riscv: add xtheadvector support (#13720) 8 months ago
  cmdr2 cb79c2e7fa ggml: don't include arm_neon.h when using CUDA 12 with ARM Neon (ggml/1187) 10 months ago
  Diego Devesa 459895c326 ggml : add more generic custom op, remove deprecated custom ops (ggml/1183) 10 months ago
  Piotr Kubaj 2391506ace ggml-impl.h: fix build on POWER9 (#12855) 10 months ago
  Georgi Gerganov ff067dbcb9 ggml : simplify Arm fp16 CPU logic (ggml/1177) 10 months ago
  xctan 24feaec057 ggml : riscv: add 128-bit RVV support (#12530) 10 months ago
  Bodhi 0b3863ff95 MUSA: support ARM64 and enable dp4a .etc (#11843) 11 months ago
  Johannes Gäßler 53ff6b9b9f GGUF: C++ refactor, backend support, misc fixes (#11030) 1 year ago
  Johannes Gäßler 081b29bd2a tests: add tests for GGUF (#10830) 1 year ago
  Diego Devesa cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797) 1 year ago
  Frankie Robertson cd2f37b304 Avoid using __fp16 on ARM with old nvcc (#10616) 1 year ago
  Diego Devesa 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) 1 year ago
  Frankie Robertson 9150f8fef9 Do not include arm_neon.h when compiling CUDA code (ggml/1028) 1 year ago
  Johannes Gäßler 02e4eaf22f ggml-opt: fix data corruption (ggml/1022) 1 year ago
  Johannes Gäßler 8a43e940ab ggml: new optimization interface (ggml/988) 1 year ago
  Diego Devesa ae8de6d50a ggml : build backends as libraries (#10256) 1 year ago
  Diego Devesa 9f40989351 ggml : move CPU backend to a separate file (#10144) 1 year ago