| .. |
|
__init__.py
|
672a6f1018
convert-*.py: GGUF Naming Convention Refactor and Metadata Override Refactor (#7499)
|
il y a 1 an |
|
constants.py
|
2a24c8caa6
Add Nemotron/Minitron GGUF Conversion & Inference Support (#8922)
|
il y a 1 an |
|
gguf.py
|
34b0a08207
gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)
|
il y a 2 ans |
|
gguf_reader.py
|
3fd62a6b1c
py : type-check all Python scripts with Pyright (#8341)
|
il y a 1 an |
|
gguf_writer.py
|
d3f0c7166a
Stop the generation when <|eom_id|> token is encountered - needed for Llama 3.1 tool call support (#8858)
|
il y a 1 an |
|
lazy.py
|
3a14e00366
gguf-py : simplify support for quant types (#8838)
|
il y a 1 an |
|
metadata.py
|
1e6f6554aa
server : add lora hotswap endpoint (WIP) (#8857)
|
il y a 1 an |
|
py.typed
|
dc07dc492e
convert : various script cleanups/fixes + merges and special token handling (#2842)
|
il y a 2 ans |
|
quants.py
|
4134999e01
gguf-py : Numpy dequantization for most types (#8939)
|
il y a 1 an |
|
tensor_mapping.py
|
2a24c8caa6
Add Nemotron/Minitron GGUF Conversion & Inference Support (#8922)
|
il y a 1 an |
|
utility.py
|
328884f421
gguf-py : fix some metadata name extraction edge cases (#8591)
|
il y a 1 an |
|
vocab.py
|
9c4c9cc83f
Move convert.py to examples/convert-legacy-llama.py (#7430)
|
il y a 1 an |