Commit History

Author SHA1 Message Date
  Alfred ce734a8a2f ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations (#17977) 4 weeks ago
  Congcong Cai 615655aafe cmake : set `CMAKE_RUNTIME_OUTPUT_DIRECTORY` for non standalone build (ggml/1394) 1 month ago
  ixgbe 79d61896d3 ggml-cpu: add ggml_thread_cpu_relax with Zihintpause support (#17784) 1 month ago
  Vishal Singh 017761daf5 ggml-zendnn : add ZenDNN backend for AMD CPUs (#17690) 1 month ago
  Adrien Gallouët ef75a89fdb build : move _WIN32_WINNT definition to headers (#17736) 1 month ago
  Reese Levine 7ca5991d2b ggml webgpu: add support for emscripten builds (#17184) 1 month ago
  xiaobing318 e251e5ebbe cmake : add utf8 compilation options for msvc (#17682) 1 month ago
  Diego Devesa e072b2052e ggml : add GGML_SCHED_NO_REALLOC option to disable reallocations in ggml_backend_sched (#17276) 1 month ago
  Daniel Bevenius 697edfeead ggml : remove dirty flag from version string (ggml/1391) 1 month ago
  Aleksei Nikiforov 08416ebe7f ggml: disable vxe for cross-compilation by default (#16966) 2 months ago
  Max Krasnyansky 63d2fc46e1 Add experimental ggml-hexagon backend for the Hexagon NPU (#16547) 2 months ago
  Reese Levine 74b8fc17f9 ggml webgpu: profiling, CI updates, reworking of command submission (#16452) 3 months ago
  uvos e95fec640f HIP: Disable ROCWMMA fattn on CDNA when compiled against ROCWMMA 2.0.0 (#16221) 3 months ago
  Georgi Gerganov 075c01567b ggml : bump version to 0.9.4 (ggml/1363) 3 months ago
  Daniel Bevenius c9b1c06467 ggml : remove -dev suffix from release version (ggml/1355) 3 months ago
  Daniel Bevenius b6ae75afb4 ggml : bump version to 0.9.3 (ggml/1353) 3 months ago
  Georgi Gerganov b6dff20e2f ggml : prepare for development of 0.9.2-dev 3 months ago
  Georgi Gerganov 2db78c75e4 ggml : bump version to 0.9.1 3 months ago
  Adrien Gallouët b995a10760 common : use cpp-httplib as a cURL alternative for downloads (#16185) 3 months ago
  Daniel Bevenius 405921dcef ggml : introduce semantic versioning (ggml/1336) 4 months ago
  Georgi Gerganov 0320ac5264 metal : refactor + optimize v2 (#15995) 4 months ago
  Aaron Teo 186415d595 ggml-cpu: drop support for nnpa intrinsics (#15821) 4 months ago
  xctan 05c0380f2a ggml-cpu : optimize RVV kernels (#15720) 4 months ago
  Charles Xu 4d74393bcc ggml: update kleidiai to v1.13.0 (#15663) 4 months ago
  Johannes Gäßler 7a6e91ad26 CUDA: replace GGML_CUDA_F16 with CUDA arch checks (#15433) 4 months ago
  Aaron Teo ff27f80a74 ggml: initial IBM zDNN backend (#14975) 5 months ago
  uvos 7ad67ba9fe HIP: add cmake option to enable compiler output of kernel resource usage metrics (#15103) 5 months ago
  Christian Kastner 41613437ff cmake: Add GGML_BACKEND_DIR option (#15074) 5 months ago
  uvos b77d11179d HIP: add GGML_HIP_MMQ_MFMA option to allow disableing the MFMA path. (#14930) 5 months ago
  Aaron Teo c7f3169cd5 ggml-cpu : disable GGML_NNPA by default due to instability (#14880) 5 months ago