Douglas Hanley 339bd0268c model : support Qwen3-Embedding (#15023) 6 months ago
..
scripts c81f4192f9 gguf-py : dump bpw per layer and model in markdown mode (#14703) 6 months ago
__init__.py 672a6f1018 convert-*.py: GGUF Naming Convention Refactor and Metadata Override Refactor (#7499) 1 year ago
constants.py 0f5ccd6fd1 model : add hunyuan dense (#14878) 6 months ago
gguf.py 34b0a08207 gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981) 2 years ago
gguf_reader.py eb0f5c28d3 gguf-py : display the invalid gguf type (#13687) 8 months ago
gguf_writer.py 8a4a856277 Add LLaDA 8b Diffusion model (#14771) 6 months ago
lazy.py a226bc7a9a gguf-py : support lazy tensor splitting (#12809) 9 months ago
metadata.py acd6cb1c41 ggml : model card yaml tab->2xspace (#14819) 6 months ago
py.typed dc07dc492e convert : various script cleanups/fixes + merges and special token handling (#2842) 2 years ago
quants.py 9bc6db28d0 ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151) 1 year ago
tensor_mapping.py 339bd0268c model : support Qwen3-Embedding (#15023) 6 months ago
utility.py 53ae30640e gguf-py : fix SafetensorRemote return on undefined size (< 0) (#13841) 8 months ago
vocab.py 00fa15fedc mtmd : add support for Voxtral (#14862) 6 months ago