cturan/llama.cpp

mirror de https://github.com/cturan/llama.cpp

Autor	SHA1 Mensagem	Data
compilade	e54d41befc gguf-py : add Numpy MXFP4 de/quantization support (#15111)	há 6 meses atrás
Georgi Gerganov	0bf2d10c55 tts : add OuteTTS support (#10784)	há 1 ano atrás
compilade	9bc6db28d0 ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151)	há 1 ano atrás
compilade	4134999e01 gguf-py : Numpy dequantization for most types (#8939)	há 1 ano atrás