Georgi Gerganov fa882fd2b1 metal : avoid using Metal's gpuAddress property (#16576) 3 months ago
..
ggml-blas 4d3d455d3c sync : whisper.cpp (ggml/1359) 3 months ago
ggml-cann 56fc38b965 CANN: fix CPU memory leak in CANN backend (#16549) 3 months ago
ggml-cpu 01d2bdc2bc ggml : fix build broken with -march=armv9-a on MacOS (#16520) 3 months ago
ggml-cuda 120bf7046d CUDA + openCL: fix bug in accessing rms_norm->src while doing fusion (#16577) 3 months ago
ggml-hip 11f0af5504 CUDA: faster tile FA, add oob checks, more HSs (#16492) 3 months ago
ggml-metal fa882fd2b1 metal : avoid using Metal's gpuAddress property (#16576) 3 months ago
ggml-musa 11f0af5504 CUDA: faster tile FA, add oob checks, more HSs (#16492) 3 months ago
ggml-opencl 120bf7046d CUDA + openCL: fix bug in accessing rms_norm->src while doing fusion (#16577) 3 months ago
ggml-rpc f39283960b rpc : check src buffer when copying tensor (#16421) 3 months ago
ggml-sycl c7be9febcb [SYCL] fix UT fault cases: count-equal, argsort, pad OPs (#16521) 3 months ago
ggml-vulkan ffa059034c vulkan: Add ACC_TYPE_VEC2 implementation (#16203) 3 months ago
ggml-webgpu 74b8fc17f9 ggml webgpu: profiling, CI updates, reworking of command submission (#16452) 3 months ago
ggml-zdnn 264f1b5187 zdnn: refactor codebase + add docs (#16178) 4 months ago
CMakeLists.txt 6d69ab3f26 cmake : Dont define XOPENSOURCE on AIX (#16481) 3 months ago
ggml-alloc.c 638d330246 ggml : fix graph reallocation with multiple chunks (#16396) 3 months ago
ggml-backend-impl.h 898acba681 rpc : add support for multiple devices (#16276) 3 months ago
ggml-backend-reg.cpp 02463ab27b ggml-backend : add root cause in error message if loading backend library fails (#16172) 3 months ago
ggml-backend.cpp e789095502 llama: print memory breakdown on exit (#15860) 4 months ago
ggml-common.h fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
ggml-impl.h 34fcc5a4ac model : Apertus model implementation (#15852) 3 months ago
ggml-opt.cpp 5cdb27e091 finetune: SGD optimizer, more CLI args (#13873) 5 months ago
ggml-quants.c f6b4af3d04 ggml : fix uninitialized is_on_grid in quantize_row_iq3_xxs_impl (#15928) 4 months ago
ggml-quants.h fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
ggml-threading.cpp ae8de6d50a ggml : build backends as libraries (#10256) 1 year ago
ggml-threading.h cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797) 1 year ago
ggml.c ef07a40906 ggml webgpu: add support for soft_max, optimize rms_norm (#16357) 3 months ago
ggml.cpp fedf034a98 ggml : Print backtrace on uncaught C++ exceptions (ggml/1232) 7 months ago
gguf.cpp a81283820a gguf: gguf_writer refactor (#15691) 4 months ago