Georgi Gerganov a19b5cef16 llama : fix FA when KV cache is not used (i.e. embeddings) (#12825) 9 months ago
..
ggml-blas 5931c1f233 ggml : add support for dynamic loading of backends (#10469) 1 year ago
ggml-cann 52b3d71f12 CANN: fix typo in ggml-cann (#12733) 9 months ago
ggml-cpu a19b5cef16 llama : fix FA when KV cache is not used (i.e. embeddings) (#12825) 9 months ago
ggml-cuda 1a1ab7e7a4 cuda : fix HIP and MUSA BF16 (#0) 9 months ago
ggml-hip becade5de7 HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (#12032) 10 months ago
ggml-kompute ba1cb19cdd llama : add Qwen2VL support + multimodal RoPE (#10361) 1 year ago
ggml-metal a19b5cef16 llama : fix FA when KV cache is not used (i.e. embeddings) (#12825) 9 months ago
ggml-musa b1b132efcb cuda : enable CUDA Graph on CUDA Toolkit < 12.x (#12394) 10 months ago
ggml-opencl 82974011f3 opencl: better identify Adreno GPU (#12760) 9 months ago
ggml-rpc ab6ab8f809 rpc : send hash when tensor data is above some fixed threshold (#12496) 9 months ago
ggml-sycl 656babd6c2 Revert "sycl:remove redundant memcopy in function ggml_backend_sycl_buffer_set_tensor" (#12812) 9 months ago
ggml-vulkan 0c74b04376 vulkan: fix NaN issue in flash attention shader (#12776) 9 months ago
CMakeLists.txt a69f846351 cmake : fix ccache conflict (#12522) 9 months ago
ggml-alloc.c 70680c48e5 ggml : upgrade init_tensor API to return a ggml_status (#11854) 10 months ago
ggml-backend-impl.h 70680c48e5 ggml : upgrade init_tensor API to return a ggml_status (#11854) 10 months ago
ggml-backend-reg.cpp ba7654380a ggml-backend : fix backend search path (#12330) 10 months ago
ggml-backend.cpp 5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150) 10 months ago
ggml-common.h 492d7f1ff7 musa: fix all warnings, re-enable `-DLLAMA_FATAL_WARNINGS=ON` in ci and update doc (#12611) 9 months ago
ggml-impl.h ff067dbcb9 ggml : simplify Arm fp16 CPU logic (ggml/1177) 9 months ago
ggml-opt.cpp 02e4eaf22f ggml-opt: fix data corruption (ggml/1022) 1 year ago
ggml-quants.c 5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150) 10 months ago
ggml-quants.h ae8de6d50a ggml : build backends as libraries (#10256) 1 year ago
ggml-threading.cpp ae8de6d50a ggml : build backends as libraries (#10256) 1 year ago
ggml-threading.h cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797) 1 year ago
ggml.c e0e912f49b llama : add option to override model tensor buffers (#11397) 9 months ago
gguf.cpp a6f32f0b34 Fix clang warning in gguf_check_reserved_keys (#12686) 9 months ago