Xuan-Son Nguyen
|
c531edfa34
convert : fix conversion for llama 4 (#13567)
|
8 months ago |
Gabe Goodhart
|
d590cd4c24
model : Granite MoE shared (#13269)
|
8 months ago |
City
|
3eac209319
mtmd : support InternVL 3 38B and 78B mmproj (#13443)
|
8 months ago |
Xuan-Son Nguyen
|
053367d149
mtmd : support InternVL 2.5 and 3 (#13422)
|
8 months ago |
Xuan-Son Nguyen
|
5215b91e93
clip : fix confused naming ffn_up and ffn_down (#13290)
|
9 months ago |
Xuan-Son Nguyen
|
074e42ab31
convert : converting mmproj for Qwen2/2.5VL from convert_hf_to_gguf (#13209)
|
9 months ago |
Xuan-Son Nguyen
|
8936784f7a
mtmd : add **vision** support for Mistral Small 3.1 (#13231)
|
9 months ago |
AT
|
5f5e39e1ba
model : Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture (#12466)
|
9 months ago |
Xuan-Son Nguyen
|
ecda2ec4b3
mtmd : Support Pixtral 12B (#13065)
|
9 months ago |
Xuan-Son Nguyen
|
dc39a5e7a8
mtmd : support SmolVLM (version 1 and 2) (#13050)
|
9 months ago |
Xuan-Son Nguyen
|
2016f07bd1
convert : experimental support for `--mmproj` flag (#13023)
|
9 months ago |
Juk Armstrong
|
daa422881a
llama : DeepSeek V2/V3 MLA implementation (#12801)
|
9 months ago |
Yuxuan Zhang
|
06bb53ad9b
llama-model : add Glm4Model implementation for GLM-4-0414 (#12867)
|
9 months ago |
Xuan-Son Nguyen
|
5b1f13cb64
convert : proper tensor name mapping for llama4 (#12870)
|
9 months ago |
Sigbjørn Skjæret
|
2c3f8b850a
llama : support BailingMoE (Ling) (#12634)
|
10 months ago |
Molly Sophia
|
7dfad387e3
llama: Add support for RWKV v7 architecture (#12412)
|
10 months ago |
Molly Sophia
|
ee7136c6d1
llama: add support for QRWKV6 model architecture (#11001)
|
1 year ago |
Pierrick Hymbert
|
f8feb4b01a
model: Add support for PhiMoE arch (#11003)
|
1 year ago |
fairydreaming
|
9394bbd484
llama : Add support for DeepSeek V3 (#11049)
|
1 year ago |
ymcki
|
6f0c9e034b
llama : support for Llama-3_1-Nemotron-51B (#10669)
|
1 year ago |
Georgi Gerganov
|
0bf2d10c55
tts : add OuteTTS support (#10784)
|
1 year ago |
Valentin Mamedov
|
a0974156f3
llama : add Deepseek MoE v1 & GigaChat models (#10827)
|
1 year ago |
Georgi Gerganov
|
c5ede3849f
convert : add custom attention mapping
|
1 year ago |
Shane A
|
80acb7b430
Rename Olmo1124 to Olmo2 (#10500)
|
1 year ago |
Shane A
|
a88ad007de
llama : add OLMo November 2024 support (#10394)
|
1 year ago |
compilade
|
1927378bcc
convert : refactor rope_freqs generation (#9396)
|
1 year ago |
Georgi Gerganov
|
f4d2b8846a
llama : add reranking support (#9510)
|
1 year ago |
nopperl
|
9a913110cf
llama : add support for Chameleon (#8543)
|
1 year ago |
Gabe Goodhart
|
3d6bf6919f
llama : add IBM Granite MoE architecture (#9438)
|
1 year ago |
Shane A
|
0aadac10c7
llama : support OLMoE (#9462)
|
1 year ago |