cturan/llama.cpp

miroir de https://github.com/cturan/llama.cpp

Auteur	SHA1 Message	Date
compilade	e54d41befc gguf-py : add Numpy MXFP4 de/quantization support (#15111)	il y a 6 mois
Georgi Gerganov	0bf2d10c55 tts : add OuteTTS support (#10784)	il y a 1 an
compilade	9bc6db28d0 ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151)	il y a 1 an
compilade	4134999e01 gguf-py : Numpy dequantization for most types (#8939)	il y a 1 an