Johannes Gäßler a15ef8f8a0 CUDA: fix partial offloading for ne0 % 256 != 0 (#8572) 1 year ago
..
ggml-cann 1bdd8ae19f [CANN] Add Ascend NPU backend (#6035) 1 year ago
ggml-cuda b078c619aa cuda : suppress 'noreturn' warn in no_device_code (#8414) 1 year ago
ggml-sycl 16bdfa42ac [SYCL] add concat through dim 1/2 (#8483) 1 year ago
kompute @ 4565194ed7 f3f65429c4 llama : reorganize source code + improve CMake (#8006) 1 year ago
kompute-shaders f3f65429c4 llama : reorganize source code + improve CMake (#8006) 1 year ago
llamafile 6b2a849d1f ggml : move sgemm sources to llamafile subfolder (#8394) 1 year ago
vulkan-shaders bda62d7999 Vulkan MMQ Fix (#8479) 1 year ago
CMakeLists.txt 1bdd8ae19f [CANN] Add Ascend NPU backend (#6035) 1 year ago
ggml-aarch64.c 8fac431b06 ggml : suppress unknown pragma 'GCC' on windows (#8460) 1 year ago
ggml-aarch64.h 370b1f7e7a ggml : minor naming changes (#8433) 1 year ago
ggml-alloc.c a15ef8f8a0 CUDA: fix partial offloading for ne0 % 256 != 0 (#8572) 1 year ago
ggml-backend-impl.h f3f65429c4 llama : reorganize source code + improve CMake (#8006) 1 year ago
ggml-backend.c a15ef8f8a0 CUDA: fix partial offloading for ne0 % 256 != 0 (#8572) 1 year ago
ggml-blas.cpp 368645698a ggml : add NVPL BLAS support (#8329) (#8425) 1 year ago
ggml-cann.cpp 1bdd8ae19f [CANN] Add Ascend NPU backend (#6035) 1 year ago
ggml-common.h 0f1a39f343 ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (#5780) 1 year ago
ggml-cuda.cu a15ef8f8a0 CUDA: fix partial offloading for ne0 % 256 != 0 (#8572) 1 year ago
ggml-impl.h 0f1a39f343 ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (#5780) 1 year ago
ggml-kompute.cpp f3f65429c4 llama : reorganize source code + improve CMake (#8006) 1 year ago
ggml-metal.m c917b67f06 metal : template-ify some of the kernels (#8447) 1 year ago
ggml-metal.metal c917b67f06 metal : template-ify some of the kernels (#8447) 1 year ago
ggml-quants.c 370b1f7e7a ggml : minor naming changes (#8433) 1 year ago
ggml-quants.h 370b1f7e7a ggml : minor naming changes (#8433) 1 year ago
ggml-rpc.cpp f3f65429c4 llama : reorganize source code + improve CMake (#8006) 1 year ago
ggml-sycl.cpp 16bdfa42ac [SYCL] add concat through dim 1/2 (#8483) 1 year ago
ggml-vulkan.cpp bda62d7999 Vulkan MMQ Fix (#8479) 1 year ago
ggml.c 1bdd8ae19f [CANN] Add Ascend NPU backend (#6035) 1 year ago