This website works better with JavaScript
Home
Explore
Help
Sign In
cturan
/
llama.cpp
mirror of
https://github.com/cturan/llama.cpp
Watch
1
Star
0
Fork
0
Files
Issues
0
Wiki
Tree:
f0678c5ff4
Branches
Tags
k2v2
master
minimax
qwen3_next
qwen3_next_optimized
toolinjection
test
b6814
Commit History
Find
Author
SHA1
Message
Date
Shupei Fan
c202cef168
ggml-cpu: support IQ4_NL_4_4 by runtime repack (
#10541
)
1 year ago
Diego Devesa
5931c1f233
ggml : add support for dynamic loading of backends (
#10469
)
1 year ago
Charles Xu
1607a5e5b0
backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (
#9921
)
1 year ago
Diego Devesa
ae8de6d50a
ggml : build backends as libraries (
#10256
)
1 year ago
Diego Devesa
9f40989351
ggml : move CPU backend to a separate file (
#10144
)
1 year ago