Georgi Gerganov 649495c9d9 metal : add FA head size 48 (#17619) 1 month ago
..
ggml-blas 4d3d455d3c sync : whisper.cpp (ggml/1359) 3 months ago
ggml-cann 2ba719519d model: LFM2-VL fixes (#17577) 1 month ago
ggml-cpu 2ba719519d model: LFM2-VL fixes (#17577) 1 month ago
ggml-cuda 6eea666912 llama-graph: avoid expand_forward for fusion (#17633) 1 month ago
ggml-hexagon 923ae3c619 hexagon: add support for ROPE_NEOX (#17458) 2 months ago
ggml-hip 80d28f104c HIP: fix AMDGPU_TARGETS, update documentation (#16803) 2 months ago
ggml-metal 649495c9d9 metal : add FA head size 48 (#17619) 1 month ago
ggml-musa 11f0af5504 CUDA: faster tile FA, add oob checks, more HSs (#16492) 3 months ago
ggml-opencl 2ba719519d model: LFM2-VL fixes (#17577) 1 month ago
ggml-rpc 15d2b46b4d rpc : cache and reuse compute graphs (#15405) 1 month ago
ggml-sycl 2ba719519d model: LFM2-VL fixes (#17577) 1 month ago
ggml-vulkan 2ba719519d model: LFM2-VL fixes (#17577) 1 month ago
ggml-webgpu 647b960bd8 ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031) 2 months ago
ggml-zdnn 264f1b5187 zdnn: refactor codebase + add docs (#16178) 4 months ago
CMakeLists.txt fa0465954f ggml: fix: macOS build with `-DGGML_BACKEND_DL=ON` (#17581) 1 month ago
ggml-alloc.c e072b2052e ggml : add GGML_SCHED_NO_REALLOC option to disable reallocations in ggml_backend_sched (#17276) 1 month ago
ggml-backend-impl.h 898acba681 rpc : add support for multiple devices (#16276) 3 months ago
ggml-backend-reg.cpp 63d2fc46e1 Add experimental ggml-hexagon backend for the Hexagon NPU (#16547) 3 months ago
ggml-backend.cpp 90c72a614a ggml : extend the GGML_SCHED_NO_REALLOC debug logic of the scheduler (#17617) 1 month ago
ggml-common.h fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
ggml-impl.h 389ac78b26 ggml : add ops SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM (#17063) 2 months ago
ggml-opt.cpp 5cdb27e091 finetune: SGD optimizer, more CLI args (#13873) 5 months ago
ggml-quants.c f6b4af3d04 ggml : fix uninitialized is_on_grid in quantize_row_iq3_xxs_impl (#15928) 4 months ago
ggml-quants.h fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
ggml-threading.cpp ae8de6d50a ggml : build backends as libraries (#10256) 1 year ago
ggml-threading.h cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797) 1 year ago
ggml.c 2ba719519d model: LFM2-VL fixes (#17577) 1 month ago
ggml.cpp fedf034a98 ggml : Print backtrace on uncaught C++ exceptions (ggml/1232) 7 months ago
gguf.cpp a81283820a gguf: gguf_writer refactor (#15691) 4 months ago