Johannes Gäßler 808aba3916 CUDA: optimize and refactor MMQ (#8416) 1 ano atrás
..
cmake f3f65429c4 llama : reorganize source code + improve CMake (#8006) 1 ano atrás
include 0f1a39f343 ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (#5780) 1 ano atrás
src 808aba3916 CUDA: optimize and refactor MMQ (#8416) 1 ano atrás
CMakeLists.txt 6b2a849d1f ggml : move sgemm sources to llamafile subfolder (#8394) 1 ano atrás
ggml_vk_generate_shaders.py 3fd62a6b1c py : type-check all Python scripts with Pyright (#8341) 1 ano atrás