Diego Devesa 360d6533db ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797) 4 months ago
..
ggml-blas e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850) 4 months ago
ggml-cann c0389dba43 CANN: Disable acl_graph for prefill stage (#15933) 4 months ago
ggml-cpu 24a6734daf ggml-cpu : add check for ARM MATMUL_INT8/i8mm support (#15922) 4 months ago
ggml-cuda 360d6533db ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797) 4 months ago
ggml-hip 29c8fbe4e0 HIP: bump requirement to rocm 6.1 (#15296) 5 months ago
ggml-metal 0f0a3c2851 metal : make the backend async (#15906) 4 months ago
ggml-musa 7a6e91ad26 CUDA: replace GGML_CUDA_F16 with CUDA arch checks (#15433) 5 months ago
ggml-opencl e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850) 4 months ago
ggml-rpc e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850) 4 months ago
ggml-sycl e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850) 4 months ago
ggml-vulkan 360d6533db ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797) 4 months ago
ggml-webgpu e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850) 4 months ago
ggml-zdnn e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850) 4 months ago
CMakeLists.txt ff27f80a74 ggml: initial IBM zDNN backend (#14975) 5 months ago
ggml-alloc.c fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
ggml-backend-impl.h 360d6533db ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797) 4 months ago
ggml-backend-reg.cpp 360d6533db ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797) 4 months ago
ggml-backend.cpp e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850) 4 months ago
ggml-common.h fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
ggml-impl.h fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
ggml-opt.cpp 5cdb27e091 finetune: SGD optimizer, more CLI args (#13873) 5 months ago
ggml-quants.c f44f793172 ggml-quants : fix make_qp_quants NANs and IQ1 assertion errors (#15379) 5 months ago
ggml-quants.h fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
ggml-threading.cpp ae8de6d50a ggml : build backends as libraries (#10256) 1 year ago
ggml-threading.h cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797) 1 year ago
ggml.c b0d52998b9 cuda : fix supports_op condition for get_rows when number of blocks is too large (#15868) 4 months ago
ggml.cpp fedf034a98 ggml : Print backtrace on uncaught C++ exceptions (ggml/1232) 7 months ago
gguf.cpp a81283820a gguf: gguf_writer refactor (#15691) 4 months ago