Johannes Gäßler 12a81af45f CUDA: broadcasting for FlashAttention mask (#14500) 6 ماه پیش
..
cmake 3555b3004b ggml-cpu : rework weak alias on apple targets (#14146) 7 ماه پیش
include ec68e84c32 ggml : support bcast ggml_soft_max_ext, ggml_flash_attn_ext (#14435) 6 ماه پیش
src 12a81af45f CUDA: broadcasting for FlashAttention mask (#14500) 6 ماه پیش
.gitignore 17eb6aa8a9 vulkan : cmake integration (#8119) 1 سال پیش
CMakeLists.txt 60ef23d6c1 ggml-cpu: enable IBM NNPA Vector Intrinsics (#14317) 7 ماه پیش