Alfred ce734a8a2f ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations (#17977) 1 month ago
..
cmake 9a96389544 ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (#15094) 5 months ago
include b1f3a6e5db llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (#16653) 1 month ago
src ce734a8a2f ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations (#17977) 1 month ago
.gitignore 17eb6aa8a9 vulkan : cmake integration (#8119) 1 year ago
CMakeLists.txt ce734a8a2f ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations (#17977) 1 month ago