Xuan-Son Nguyen
|
c531edfa34
convert : fix conversion for llama 4 (#13567)
|
8 months ago |
Gabe Goodhart
|
d590cd4c24
model : Granite MoE shared (#13269)
|
8 months ago |
Sigbjørn Skjæret
|
d2a4ef05c6
vocab : add ByteDance-Seed/Seed-Coder (#13423)
|
8 months ago |
Xuan-Son Nguyen
|
053367d149
mtmd : support InternVL 2.5 and 3 (#13422)
|
8 months ago |
Sigbjørn Skjæret
|
1a844be132
convert : support rope_scaling type and rope_type (#13349)
|
8 months ago |
Xuan-Son Nguyen
|
32916a4907
clip : refactor graph builder (#13321)
|
8 months ago |
Sigbjørn Skjæret
|
764b85627b
convert : qwen2/3moe : set yarn metadata if present (#13331)
|
8 months ago |
Xuan-Son Nguyen
|
5215b91e93
clip : fix confused naming ffn_up and ffn_down (#13290)
|
8 months ago |
Sigbjørn Skjæret
|
ae803bfc3d
convert : bailingmoe : set yarn metadata if present (#13312)
|
8 months ago |
ymcki
|
3bf785f3ef
llama : Llama-3_1-Nemotron-Ultra-253B-v1 support (#12843)
|
9 months ago |
Jared Van Bortel
|
2f567611c0
llama-model : support Qwen2 embedding models and pooling_mode_lasttoken (#13245)
|
9 months ago |
Jared Van Bortel
|
7d2123484e
convert : use correct context length for nomic-embed-text-v2 (#13216)
|
9 months ago |
Xuan-Son Nguyen
|
074e42ab31
convert : converting mmproj for Qwen2/2.5VL from convert_hf_to_gguf (#13209)
|
9 months ago |
Xuan-Son Nguyen
|
dcf886007d
convert : explicitly disable trust_remote_code for AutoConfig (#13246)
|
9 months ago |
Xuan-Son Nguyen
|
8936784f7a
mtmd : add **vision** support for Mistral Small 3.1 (#13231)
|
9 months ago |
Xuan-Son Nguyen
|
3e168bede4
convert : improve model arch handling (#13122)
|
9 months ago |
Xuan-Son Nguyen
|
07c2e2f76c
convert : correct typo image_mean --> image_std (#13208)
|
9 months ago |
AT
|
5f5e39e1ba
model : Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture (#12466)
|
9 months ago |
matteo
|
ced44be342
llama-chat : fix wrong template in GLM4-0414 (#13140)
|
9 months ago |
HimariO
|
ca2bb89eac
clip : Add Qwen2.5VL support (#12402)
|
9 months ago |
Xuan-Son Nguyen
|
ecda2ec4b3
mtmd : Support Pixtral 12B (#13065)
|
9 months ago |
piDack
|
eb1776b15a
convert : Append mult-eos,half-rope,bos to GLM4-0414 and Z (#13021)
|
9 months ago |
Xuan-Son Nguyen
|
dc39a5e7a8
mtmd : support SmolVLM (version 1 and 2) (#13050)
|
9 months ago |
Xuan-Son Nguyen
|
2016f07bd1
convert : experimental support for `--mmproj` flag (#13023)
|
9 months ago |
Juk Armstrong
|
daa422881a
llama : DeepSeek V2/V3 MLA implementation (#12801)
|
9 months ago |
Yuxuan Zhang
|
06bb53ad9b
llama-model : add Glm4Model implementation for GLM-4-0414 (#12867)
|
9 months ago |
Daniel Han
|
ec6c09d0fa
convert : Llama4 RoPE fix (#12889)
|
9 months ago |
Xuan-Son Nguyen
|
5b1f13cb64
convert : proper tensor name mapping for llama4 (#12870)
|
9 months ago |
Xuan-Son Nguyen
|
64eda5deb9
convert : ability to lazy-load safetensors remotely without downloading to disk (#12820)
|
9 months ago |
Bo Zheng
|
d3bd7193ba
llama : Support Qwen3 and Qwen3MoE (#12828)
|
9 months ago |