Prajwal B Mehendarkar 6d69ab3f26 cmake : Dont define XOPENSOURCE on AIX (#16481) 3 months ago
..
ggml-blas 4d3d455d3c sync : whisper.cpp (ggml/1359) 3 months ago
ggml-cann aa4711d369 CANN: Improve ACL graph matching (#16166) 3 months ago
ggml-cpu 1deee0f8d4 cpu : optimize the ggml NORM operation (#15953) 3 months ago
ggml-cuda 9d0882840e Disable CUDA host buffers on integrated GPUs (#16308) 3 months ago
ggml-hip e95fec640f HIP: Disable ROCWMMA fattn on CDNA when compiled against ROCWMMA 2.0.0 (#16221) 3 months ago
ggml-metal b2c08c9ec4 metal : mark FA blocks (#16372) 3 months ago
ggml-musa 91a2a56556 musa: update compile flags (#16265) 3 months ago
ggml-opencl 7c156df414 opencl: support pad_ext (#15888) 3 months ago
ggml-rpc f39283960b rpc : check src buffer when copying tensor (#16421) 3 months ago
ggml-sycl b260213755 [SYCL] refactor soft_max, add soft_max_back (#16472) 3 months ago
ggml-vulkan 86df2c9ae4 vulkan: use a more appropriate amount of threads when generating shaders (#16418) 3 months ago
ggml-webgpu 74b8fc17f9 ggml webgpu: profiling, CI updates, reworking of command submission (#16452) 3 months ago
ggml-zdnn 264f1b5187 zdnn: refactor codebase + add docs (#16178) 3 months ago
CMakeLists.txt 6d69ab3f26 cmake : Dont define XOPENSOURCE on AIX (#16481) 3 months ago
ggml-alloc.c 638d330246 ggml : fix graph reallocation with multiple chunks (#16396) 3 months ago
ggml-backend-impl.h 898acba681 rpc : add support for multiple devices (#16276) 3 months ago
ggml-backend-reg.cpp 02463ab27b ggml-backend : add root cause in error message if loading backend library fails (#16172) 3 months ago
ggml-backend.cpp e789095502 llama: print memory breakdown on exit (#15860) 3 months ago
ggml-common.h fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
ggml-impl.h 34fcc5a4ac model : Apertus model implementation (#15852) 3 months ago
ggml-opt.cpp 5cdb27e091 finetune: SGD optimizer, more CLI args (#13873) 5 months ago
ggml-quants.c f6b4af3d04 ggml : fix uninitialized is_on_grid in quantize_row_iq3_xxs_impl (#15928) 3 months ago
ggml-quants.h fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
ggml-threading.cpp ae8de6d50a ggml : build backends as libraries (#10256) 1 year ago
ggml-threading.h cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797) 1 year ago
ggml.c ef07a40906 ggml webgpu: add support for soft_max, optimize rms_norm (#16357) 3 months ago
ggml.cpp fedf034a98 ggml : Print backtrace on uncaught C++ exceptions (ggml/1232) 7 months ago
gguf.cpp a81283820a gguf: gguf_writer refactor (#15691) 4 months ago