cturan/llama.cpp @ 408616adbdae2494b8bf23e048ef059fb681a474

Alfred ce734a8a2f ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations (#17977)		1 month ago
..
cmake	9a96389544 ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (#15094)	5 months ago
include	b1f3a6e5db llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (#16653)	1 month ago
src	ce734a8a2f ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations (#17977)	1 month ago
.gitignore	17eb6aa8a9 vulkan : cmake integration (#8119)	1 year ago
CMakeLists.txt	ce734a8a2f ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations (#17977)	1 month ago