Charles Xu c4df49a42d kleidiai: generalize compute_forward_kv_cache to compute_forward_fp16 (#15817) 4 months ago
..
ggml-blas cd6983d56d ggml : fix field name when new ggml_backend (#14944) 5 months ago
ggml-cann c1c354e44c CANN: Refactor ND to NZ workspace to be per-device (#15763) 4 months ago
ggml-cpu c4df49a42d kleidiai: generalize compute_forward_kv_cache to compute_forward_fp16 (#15817) 4 months ago
ggml-cuda 5143fa895e CUDA: fastdiv, launch bounds for mmvq + q8_1 quant (#15802) 4 months ago
ggml-hip 29c8fbe4e0 HIP: bump requirement to rocm 6.1 (#15296) 5 months ago
ggml-metal 856ed0947f metal : Add template specialization for mul_mm_id w/ ne20 == 10 (#15799) 4 months ago
ggml-musa 7a6e91ad26 CUDA: replace GGML_CUDA_F16 with CUDA arch checks (#15433) 5 months ago
ggml-opencl 0a1b3982cd ggml: add ops for WAN video model (cuda && cpu) (#15669) 4 months ago
ggml-rpc e71d48e326 ggml-rpc: chunk send()/recv() to avoid EINVAL for very large tensors over RPC (macOS & others) (#15188) 5 months ago
ggml-sycl 0a1b3982cd ggml: add ops for WAN video model (cuda && cpu) (#15669) 4 months ago
ggml-vulkan 0a1b3982cd ggml: add ops for WAN video model (cuda && cpu) (#15669) 4 months ago
ggml-webgpu 77dee9de97 ggml : WebGPU add TRANSPOSE and RESHAPE to supported ops (#15695) 4 months ago
ggml-zdnn ff27f80a74 ggml: initial IBM zDNN backend (#14975) 5 months ago
CMakeLists.txt ff27f80a74 ggml: initial IBM zDNN backend (#14975) 5 months ago
ggml-alloc.c fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
ggml-backend-impl.h 70680c48e5 ggml : upgrade init_tensor API to return a ggml_status (#11854) 10 months ago
ggml-backend-reg.cpp ff27f80a74 ggml: initial IBM zDNN backend (#14975) 5 months ago
ggml-backend.cpp 5d804a4938 ggml-backend: raise GGML_MAX_SPLIT_INPUTS (#15722) 4 months ago
ggml-common.h fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
ggml-impl.h fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
ggml-opt.cpp 5cdb27e091 finetune: SGD optimizer, more CLI args (#13873) 5 months ago
ggml-quants.c f44f793172 ggml-quants : fix make_qp_quants NANs and IQ1 assertion errors (#15379) 5 months ago
ggml-quants.h fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
ggml-threading.cpp ae8de6d50a ggml : build backends as libraries (#10256) 1 year ago
ggml-threading.h cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797) 1 year ago
ggml.c 0a1b3982cd ggml: add ops for WAN video model (cuda && cpu) (#15669) 4 months ago
ggml.cpp fedf034a98 ggml : Print backtrace on uncaught C++ exceptions (ggml/1232) 7 months ago
gguf.cpp a81283820a gguf: gguf_writer refactor (#15691) 4 months ago