cturan/llama.cpp @ fcf6538ba6702c55eaec70da9a75c81d04900a72

Georgi Gerganov fabf30b4c4 llama : remove Persimmon (#7408)		1 yıl önce
..
__init__.py	ee52225067 convert-hf : support direct Q8_0 conversion (#7234)	1 yıl önce
constants.py	fabf30b4c4 llama : remove Persimmon (#7408)	1 yıl önce
gguf.py	34b0a08207 gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)	2 yıl önce
gguf_reader.py	f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)	1 yıl önce
gguf_writer.py	ee52225067 convert-hf : support direct Q8_0 conversion (#7234)	1 yıl önce
lazy.py	ee52225067 convert-hf : support direct Q8_0 conversion (#7234)	1 yıl önce
py.typed	dc07dc492e convert : various script cleanups/fixes + merges and special token handling (#2842)	2 yıl önce
quants.py	ee52225067 convert-hf : support direct Q8_0 conversion (#7234)	1 yıl önce
tensor_mapping.py	b83cc3f5b3 llama : add Jina Embeddings architecture (#6826)	1 yıl önce
vocab.py	f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)	1 yıl önce