| .. |
|
__init__.py
|
34b0a08207
gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)
|
vor 2 Jahren |
|
constants.py
|
c1386c936e
gguf-py : add IQ1_M to GGML_QUANT_SIZES (#6761)
|
vor 1 Jahr |
|
gguf.py
|
34b0a08207
gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)
|
vor 2 Jahren |
|
gguf_reader.py
|
7ce2c77f88
gguf : add support for I64 and F64 arrays (#6062)
|
vor 1 Jahr |
|
gguf_writer.py
|
03c0946d73
convert : support models with multiple chat templates (#6588)
|
vor 1 Jahr |
|
py.typed
|
dc07dc492e
convert : various script cleanups/fixes + merges and special token handling (#2842)
|
vor 2 Jahren |
|
tensor_mapping.py
|
f4dea7da18
llama : add qwen2moe (#6074)
|
vor 1 Jahr |
|
vocab.py
|
03c0946d73
convert : support models with multiple chat templates (#6588)
|
vor 1 Jahr |