Xuan-Son Nguyen
|
8936784f7a
mtmd : add **vision** support for Mistral Small 3.1 (#13231)
|
9 meses atrás |
Xuan-Son Nguyen
|
3e168bede4
convert : improve model arch handling (#13122)
|
9 meses atrás |
Xuan-Son Nguyen
|
07c2e2f76c
convert : correct typo image_mean --> image_std (#13208)
|
9 meses atrás |
AT
|
5f5e39e1ba
model : Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture (#12466)
|
9 meses atrás |
matteo
|
ced44be342
llama-chat : fix wrong template in GLM4-0414 (#13140)
|
9 meses atrás |
HimariO
|
ca2bb89eac
clip : Add Qwen2.5VL support (#12402)
|
9 meses atrás |
Xuan-Son Nguyen
|
ecda2ec4b3
mtmd : Support Pixtral 12B (#13065)
|
9 meses atrás |
piDack
|
eb1776b15a
convert : Append mult-eos,half-rope,bos to GLM4-0414 and Z (#13021)
|
9 meses atrás |
Xuan-Son Nguyen
|
dc39a5e7a8
mtmd : support SmolVLM (version 1 and 2) (#13050)
|
9 meses atrás |
Xuan-Son Nguyen
|
2016f07bd1
convert : experimental support for `--mmproj` flag (#13023)
|
9 meses atrás |
Juk Armstrong
|
daa422881a
llama : DeepSeek V2/V3 MLA implementation (#12801)
|
9 meses atrás |
Yuxuan Zhang
|
06bb53ad9b
llama-model : add Glm4Model implementation for GLM-4-0414 (#12867)
|
10 meses atrás |
Daniel Han
|
ec6c09d0fa
convert : Llama4 RoPE fix (#12889)
|
10 meses atrás |
Xuan-Son Nguyen
|
5b1f13cb64
convert : proper tensor name mapping for llama4 (#12870)
|
10 meses atrás |
Xuan-Son Nguyen
|
64eda5deb9
convert : ability to lazy-load safetensors remotely without downloading to disk (#12820)
|
10 meses atrás |
Bo Zheng
|
d3bd7193ba
llama : Support Qwen3 and Qwen3MoE (#12828)
|
10 meses atrás |
Xuan-Son Nguyen
|
1466621e73
llama : Support llama 4 text-only (#12791)
|
10 meses atrás |
Sigbjørn Skjæret
|
5936a616e4
convert : BailingMoE : fix qkv split when head_dim is 0 (#12687)
|
10 meses atrás |
Sigbjørn Skjæret
|
35782aeedb
convert : BailingMoE : avoid setting rope_dim to 0 (#12678)
|
10 meses atrás |
Sigbjørn Skjæret
|
403fbacbbc
convert : Qwerky : use lora_rank_tokenshift and lora_rank_decay if present (#12667)
|
10 meses atrás |
Sigbjørn Skjæret
|
2c3f8b850a
llama : support BailingMoE (Ling) (#12634)
|
10 meses atrás |
Juyoung Suk
|
b3de7cac73
llama : add Trillion 7B model support (#12556)
|
10 meses atrás |
Si1w
|
f125b8dccf
llama : add PLM GGUF Conversion & Inference Support (#12457)
|
10 meses atrás |
Csaba Kecskemeti
|
d5c6309d91
convert : Support Qwen2_5_VLForConditionalGeneration (#12595)
|
10 meses atrás |
Georgi Gerganov
|
df4d20cd53
convert : fix squeeze for ssm_conv tensors (#12573)
|
10 meses atrás |
Sigbjørn Skjæret
|
53af4dba42
convert: fix Mistral3/Gemma3 model hparams init (#12571)
|
10 meses atrás |
compilade
|
00d53800e0
llama-vocab : add SuperBPE pre-tokenizer (#12532)
|
10 meses atrás |
Bartowski
|
732b5fbf5e
convert : avoid calls to tokenizer.added_tokens_decoder (#12473)
|
10 meses atrás |
Sigbjørn Skjæret
|
108e53c2f1
llama : add support for GPT2, Bloom and CodeShell tied word embeddings (#12456)
|
10 meses atrás |
Xuan-Son Nguyen
|
29fff308c7
llama : support converting Mistral Small text-only (#12450)
|
10 meses atrás |