sharpHL f2e69d28c0 llama : add support for Orion-14B (#5118) 2 年之前
..
examples 34b0a08207 gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981) 2 年之前
gguf f2e69d28c0 llama : add support for Orion-14B (#5118) 2 年之前
scripts 91f6499393 Respect tokenizer.ggml.add_bos_token value when tokenizing (#4040) 2 年之前
tests 34b0a08207 gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981) 2 年之前
LICENSE 87e3733f24 gguf : make gguf pip-installable 2 年之前
README.md 769a7bc85e gguf-py : fix broken link 2 年之前
pyproject.toml 799a1cb13b llama : add Mixtral support (#4406) 2 年之前

README.md

gguf

This is a Python package for writing binary files in the GGUF (GGML Universal File) format.

See convert-llama-hf-to-gguf.py as an example for its usage.

Installation

pip install gguf

API Examples/Simple Tools

examples/writer.py — Generates example.gguf in the current directory to demonstrate generating a GGUF file. Note that this file cannot be used as a model.

scripts/gguf-dump.py — Dumps a GGUF file's metadata to the console.

scripts/gguf-set-metadata.py — Allows changing simple metadata values in a GGUF file by key.

scripts/gguf-convert-endian.py — Allows converting the endianness of GGUF files.

Development

Maintainers who participate in development of this package are advised to install it in editable mode:

cd /path/to/llama.cpp/gguf-py

pip install --editable .

Note: This may require to upgrade your Pip installation, with a message saying that editable installation currently requires setup.py. In this case, upgrade Pip to the latest:

pip install --upgrade pip

Automatic publishing with CI

There's a GitHub workflow to make a release automatically upon creation of tags in a specified format.

  1. Bump the version in pyproject.toml.
  2. Create a tag named gguf-vx.x.x where x.x.x is the semantic version number.

    git tag -a gguf-v1.0.0 -m "Version 1.0 release"
    
  3. Push the tags.

    git push origin --tags
    

Manual publishing

If you want to publish the package manually for any reason, you need to have twine and build installed:

pip install build twine

Then, follow these steps to release a new version:

  1. Bump the version in pyproject.toml.
  2. Build the package:

    python -m build
    
  3. Upload the generated distribution archives:

    python -m twine upload dist/*
    

TODO

  • Add tests
  • Include conversion scripts as command line entry points in this package.