cturan/llama.cpp @ 6bf9b66fa3f263ca2175dcb5f6d0a658581e1dfb

compilade ee52225067 convert-hf : support direct Q8_0 conversion (#7234)		před 1 rokem
..
__init__.py	ee52225067 convert-hf : support direct Q8_0 conversion (#7234)	před 1 rokem
constants.py	5a419926b0 convert-hf : support bfloat16 conversion (#7158)	před 1 rokem
gguf.py	34b0a08207 gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)	před 2 roky
gguf_reader.py	f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)	před 1 rokem
gguf_writer.py	ee52225067 convert-hf : support direct Q8_0 conversion (#7234)	před 1 rokem
lazy.py	ee52225067 convert-hf : support direct Q8_0 conversion (#7234)	před 1 rokem
py.typed	dc07dc492e convert : various script cleanups/fixes + merges and special token handling (#2842)	před 2 roky
quants.py	ee52225067 convert-hf : support direct Q8_0 conversion (#7234)	před 1 rokem
tensor_mapping.py	b83cc3f5b3 llama : add Jina Embeddings architecture (#6826)	před 1 rokem
vocab.py	f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)	před 1 rokem