Nindaleth
|
87c2e8b279
gguf-dump : support i-quants (#5841)
|
há 1 ano atrás |
Sourab Mangrulkar
|
c29af7e225
llama : add StarCoder2 support (#5795)
|
há 1 ano atrás |
postmasters
|
580111d42b
llama : add `gemma` model (#5631)
|
há 1 ano atrás |
Douglas Hanley
|
4524290e87
Use correct type of pooling for embedding models (#5500)
|
há 2 anos atrás |
Michaël de Vries
|
73122473ff
fix(gguf-py): special tokens are no longer skipped when add_<token>_token is set to false (#5487)
|
há 2 anos atrás |
Jared Van Bortel
|
ea9c8e1143
llama : add support for Nomic Embed (#5468)
|
há 2 anos atrás |
Douglas Hanley
|
03bf161eb6
llama : support batched embeddings (#5466)
|
há 2 anos atrás |
Douglas Hanley
|
2891c8aa9a
Add support for BERT embedding models (#5423)
|
há 2 anos atrás |
runfuture
|
316c7faf77
llama : add MiniCPM support (#5346)
|
há 2 anos atrás |
Guoteng
|
ce32060198
llama : support InternLM2 (#5184)
|
há 2 anos atrás |
sharpHL
|
f2e69d28c0
llama : add support for Orion-14B (#5118)
|
há 2 anos atrás |
Shijie
|
9b75cb2b3c
llama : support upcoming Qwen2 (#5037)
|
há 2 anos atrás |
chiranko
|
2b3b999cac
llama : add CodeShell support (#5016)
|
há 2 anos atrás |
Georgi Gerganov
|
15ebe59210
convert : update phi-2 to latest HF repo (#4903)
|
há 2 anos atrás |
postmasters
|
83e633c27e
llama : differentiate the KV dims in the attention (#4657)
|
há 2 anos atrás |
manikbhandari
|
ea5497df5d
gpt2 : Add gpt2 architecture integration (#4555)
|
há 2 anos atrás |
Nam D. Tran
|
f6793491b5
llama : add AWQ for llama, llama2, mpt, and mistral models (#4593)
|
há 2 anos atrás |
Shintarou Okada
|
753be377b6
llama : add PLaMo model (#3557)
|
há 2 anos atrás |
Ebey Abraham
|
b9e74f9bca
llama : add phi-2 + fix NeoX rope + ggml_mul_mat_set_prec (#4490)
|
há 2 anos atrás |
slaren
|
799a1cb13b
llama : add Mixtral support (#4406)
|
há 2 anos atrás |
Shijie
|
37c746d687
llama : add Qwen support (#4281)
|
há 2 anos atrás |
slaren
|
e937066420
gguf-py : export chat templates (#4125)
|
há 2 anos atrás |
Galunid
|
36eed0c42c
stablelm : StableLM support (#3586)
|
há 2 anos atrás |
Kerfuffle
|
34b0a08207
gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)
|
há 2 anos atrás |