Brian
|
a2ac89d6ef
convert.py : add python logging instead of print() (#6511)
|
1 year ago |
Georgi Gerganov
|
f4ab2a4147
llama : fix BPE pre-tokenization (#6920)
|
1 year ago |
Xuan Son Nguyen
|
7bb36ccf91
gguf : enforce that tensor names are unique (#6905)
|
1 year ago |
Sigbjørn Skjæret
|
03c0946d73
convert : support models with multiple chat templates (#6588)
|
1 year ago |
Daniel Bevenius
|
4fbd8098e6
gguf : add special tokens metadata for FIM/Infill (#6689)
|
1 year ago |
Brian
|
a8bd14d557
gguf.py : add licence and version to gguf writer (#6504)
|
1 year ago |
Andrew Canis
|
12247f4c69
llama : add Command-R support (#6033)
|
1 year ago |
Ondřej Čertík
|
7ce2c77f88
gguf : add support for I64 and F64 arrays (#6062)
|
1 year ago |
Michael Podvitskiy
|
69ff61397d
llama : support models without vocabulary (#5798)
|
1 year ago |
Georgi Gerganov
|
77178eedc8
gguf-py : fix dtype check (#6045)
|
1 year ago |
Ondřej Čertík
|
3ca23481dd
gguf-py : add support for I8, I16 and I32 (#6045)
|
1 year ago |
compilade
|
c2101a2e90
llama : support Mamba Selective State Space Models (#5328)
|
1 year ago |
Jared Van Bortel
|
c7a0ad8ec9
convert-hf : make model class definitions self-contained (#5825)
|
1 year ago |
Douglas Hanley
|
4524290e87
Use correct type of pooling for embedding models (#5500)
|
1 year ago |
Michaël de Vries
|
73122473ff
fix(gguf-py): special tokens are no longer skipped when add_<token>_token is set to false (#5487)
|
1 year ago |
Douglas Hanley
|
03bf161eb6
llama : support batched embeddings (#5466)
|
1 year ago |
Douglas Hanley
|
2891c8aa9a
Add support for BERT embedding models (#5423)
|
1 year ago |
Guoteng
|
ce32060198
llama : support InternLM2 (#5184)
|
1 year ago |
postmasters
|
83e633c27e
llama : differentiate the KV dims in the attention (#4657)
|
2 years ago |
slaren
|
799a1cb13b
llama : add Mixtral support (#4406)
|
2 years ago |
Galunid
|
f23c0359a3
ci : add flake8 to github actions (python linting) (#4129)
|
2 years ago |
slaren
|
e937066420
gguf-py : export chat templates (#4125)
|
2 years ago |
Kerfuffle
|
21fd874c8d
gguf-py: gguf_writer: Use bytearray to build metadata (#4051)
|
2 years ago |
Kerfuffle
|
34b0a08207
gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)
|
2 years ago |