cturan/llama.cpp

espejo de https://github.com/cturan/llama.cpp

Autor	SHA1 Mensaje	Fecha
Alfred	ce734a8a2f ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations (#17977)	hace 4 semanas
Max Krasnyansky	63d2fc46e1 Add experimental ggml-hexagon backend for the Hexagon NPU (#16547)	hace 2 meses