cturan/llama.cpp @ 6a2f0b3474d479bda4ac2ee7cfd5dcdcf0be1f79

compilade ed9f252118 gguf-py : decouple adding metadata from writing in GGUFWriter (#7827)		hace 1 año
..
__init__.py	ee52225067 convert-hf : support direct Q8_0 conversion (#7234)	hace 1 año
constants.py	f5d7b268ec llama : add jina v2 base code (#7596)	hace 1 año
gguf.py	34b0a08207 gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)	hace 2 años
gguf_reader.py	b83bab15a5 gguf-py : fix and simplify quantized shape round-trip (#7483)	hace 1 año
gguf_writer.py	ed9f252118 gguf-py : decouple adding metadata from writing in GGUFWriter (#7827)	hace 1 año
lazy.py	ee52225067 convert-hf : support direct Q8_0 conversion (#7234)	hace 1 año
py.typed	dc07dc492e convert : various script cleanups/fixes + merges and special token handling (#2842)	hace 2 años
quants.py	b83bab15a5 gguf-py : fix and simplify quantized shape round-trip (#7483)	hace 1 año
tensor_mapping.py	f5d7b268ec llama : add jina v2 base code (#7596)	hace 1 año
vocab.py	9c4c9cc83f Move convert.py to examples/convert-legacy-llama.py (#7430)	hace 1 año