cturan/llama.cpp @ 808aba39161e5d7ca2ff24110b5aa14d2e536988

mirror de https://github.com/cturan/llama.cpp

Johannes Gäßler 808aba3916 CUDA: optimize and refactor MMQ (#8416)		1 ano atrás
..
cmake	f3f65429c4 llama : reorganize source code + improve CMake (#8006)	1 ano atrás
include	0f1a39f343 ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (#5780)	1 ano atrás
src	808aba3916 CUDA: optimize and refactor MMQ (#8416)	1 ano atrás
CMakeLists.txt	6b2a849d1f ggml : move sgemm sources to llamafile subfolder (#8394)	1 ano atrás
ggml_vk_generate_shaders.py	3fd62a6b1c py : type-check all Python scripts with Pyright (#8341)	1 ano atrás