cturan/llama.cpp @ ea23c15990ea7b09cf501ef103020f05fc480511

mirror of https://github.com/cturan/llama.cpp

Max Krasnyansky 95ea9e0861 Hexagon add support for f16/f32 flash attention, scale, set-rows and improve f16/32 matmul (#18611)		3 weeks ago
..
adb	95ea9e0861 Hexagon add support for f16/f32 flash attention, scale, set-rows and improve f16/32 matmul (#18611)	3 weeks ago
qdc	63d2fc46e1 Add experimental ggml-hexagon backend for the Hexagon NPU (#16547)	3 months ago