This website works better with JavaScript
Home
Explore
Help
Sign In
cturan
/
llama.cpp
mirror of
https://github.com/cturan/llama.cpp
Watch
1
Star
0
Fork
0
Files
Issues
0
Wiki
Tree:
ea23c15990
Branches
Tags
k2v2
master
minimax
qwen3_next
qwen3_next_optimized
toolinjection
test
b6814
llama.cpp
/
scripts
/
snapdragon
Max Krasnyansky
95ea9e0861
Hexagon add support for f16/f32 flash attention, scale, set-rows and improve f16/32 matmul (
#18611
)
3 weeks ago
..
adb
95ea9e0861
Hexagon add support for f16/f32 flash attention, scale, set-rows and improve f16/32 matmul (
#18611
)
3 weeks ago
qdc
63d2fc46e1
Add experimental ggml-hexagon backend for the Hexagon NPU (
#16547
)
3 months ago