slaren
|
f4d973cecb
convert.py : fix llama/llama2 conversion due to vocab_size=-1 (#4258)
|
2 gadi atpakaļ |
crasm
|
3014b5415d
Update docs for yarn_ext_factor <0.0 as unspecified instead of NaN (#4189)
|
2 gadi atpakaļ |
Galunid
|
f23c0359a3
ci : add flake8 to github actions (python linting) (#4129)
|
2 gadi atpakaļ |
Don Mahurin
|
2ab0707acb
convert : use 'model' value if it exists. This allows karpathy/tinyllamas to load (#4089)
|
2 gadi atpakaļ |
afrideva
|
b46d12f86d
convert.py: also look for plain model.safetensors (#4043)
|
2 gadi atpakaļ |
Kerfuffle
|
34b0a08207
gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)
|
2 gadi atpakaļ |
Galunid
|
a75fa576ab
scripts: Generalize convert scripts (#3838)
|
2 gadi atpakaļ |
cebtenzzre
|
898aeca90a
llama : implement YaRN RoPE scaling (#2268)
|
2 gadi atpakaļ |
Georgi Gerganov
|
8a2f2fea29
convert : ignore tokens if their IDs are within [0, vocab_size) (#3831)
|
2 gadi atpakaļ |
Kerfuffle
|
a5e7dbd614
llama : validate special token ids are in range when loading GGUF model (#3635)
|
2 gadi atpakaļ |
Qin Yue Chen
|
8cf19d60dc
gguf : support big endian platform (#3552)
|
2 gadi atpakaļ |
goerch
|
ff5a3f0c09
Work on the BPE tokenizer (#3252)
|
2 gadi atpakaļ |
cebtenzzre
|
0fe321031a
gguf : general usability improvements (#3409)
|
2 gadi atpakaļ |
Zhang Peiyuan
|
e519621010
convert : remove bug in convert.py permute function (#3364)
|
2 gadi atpakaļ |
Erik Scholz
|
6eeb4d9083
convert: remove most of the n_mult usage in convert.py (#3098)
|
2 gadi atpakaļ |
Cebtenzzre
|
6336d834ec
convert : fix F32 ftype not being saved (#3048)
|
2 gadi atpakaļ |
Erik Scholz
|
c9c3220c48
convert: fix convert.py not working with int filename_stem (#3028)
|
2 gadi atpakaļ |
Kerfuffle
|
cff7b0bf07
convert.py : BPE fixes (#2938)
|
2 gadi atpakaļ |
Cebtenzzre
|
bce1fef328
convert : fix another python 3.8 issue (#2949)
|
2 gadi atpakaļ |
Kerfuffle
|
aeefac4ff7
scripts: Use local gguf package when running from repo (#2927)
|
2 gadi atpakaļ |
Cebtenzzre
|
92d0b751a7
convert : fix python 3.8 support, modernize type annotations (#2916)
|
2 gadi atpakaļ |
Georgi Gerganov
|
b532a69b2f
convert.py : use dir name to name the llama
|
2 gadi atpakaļ |
Kerfuffle
|
dc07dc492e
convert : various script cleanups/fixes + merges and special token handling (#2842)
|
2 gadi atpakaļ |
jameswu2014
|
bcce96ba4d
convert.py : fix baichuan7B support (#2870)
|
2 gadi atpakaļ |
Kerfuffle
|
730d9c681e
convert.py : advanced option (#2753)
|
2 gadi atpakaļ |
Nigel Bosch
|
a2ca4e9de9
Handle null rope scaling value (#2793)
|
2 gadi atpakaļ |
Nigel Bosch
|
28b2c996ca
convert.py : Get rope scale from HuggingFace models (#2772)
|
2 gadi atpakaļ |
slaren
|
12e2e33a97
convert.py : export rope freq_base when converting CodeLlama from an HF model (#2773)
|
2 gadi atpakaļ |
slaren
|
d0f77b1353
convert.py : try to determine n_ctx automatically for CodeLlama (#2770)
|
2 gadi atpakaļ |
slaren
|
0d3094f0c7
gguf : add rope_freq_base parameter for CodeLlama (#2769)
|
2 gadi atpakaļ |