This website works better with JavaScript
Home
Explore
Help
Sign In
cturan
/
llama.cpp
mirror of
https://github.com/cturan/llama.cpp
Watch
1
Star
0
Fork
0
Files
Issues
0
Wiki
Tree:
e4386f417f
Branches
Tags
k2v2
master
minimax
qwen3_next
qwen3_next_optimized
toolinjection
test
b6814
Commit History
Find
Author
SHA1
Message
Date
Howard Su
58970a4c39
Leverage mmap for offloading tensors to GPU (
#1597
)
2 years ago
Robert Sung-wook Shin
98ed165574
OpenCL: Add release memory (
#1741
)
2 years ago
0cc4m
dcb2ed4826
OpenCL: Fix duplication of layers in VRAM and RAM, add GPU mul kernel (
#1653
)
2 years ago
0cc4m
2e6cd4b025
OpenCL Token Generation Acceleration (
#1459
)
2 years ago
0cc4m
7296c961d9
ggml : add CLBlast support (
#1164
)
2 years ago