Jie Fu (傅杰)
|
4f658855fa
llama : support T5 models with unequal number of encoder-decoder layers (#15909)
|
4 meses atrás |
Daniel Bevenius
|
233d773d02
convert : force setting sliding_window from original config (#15867)
|
5 meses atrás |
Daniel Bevenius
|
fb15d649ed
llama : add support for EmbeddingGemma 300m (#15798)
|
5 meses atrás |
Jie Fu (傅杰)
|
4b20d8b7e3
convert : remove redundant code (#15708)
|
5 meses atrás |
Gabe Goodhart
|
e8d99dd0b6
nvidia nemotron nano v2 (nemotronh) (#15507)
|
5 meses atrás |
Sigbjørn Skjæret
|
84ab83cc0b
model : jina-embeddings-v3 support (#13693)
|
5 meses atrás |
Xuan-Son Nguyen
|
79a546220c
mtmd : support Kimi VL model (#15458)
|
5 meses atrás |
Weizhao Ouyang
|
0d5a470223
convert : update Ernie 4.5 dense architecture name (#15555)
|
5 meses atrás |
RunningLeon
|
7da9fed0d6
convert : support interns1-mini (#15412)
|
5 meses atrás |
Piotr Wilkin (ilintar)
|
b1afcab804
model : add support for Seed-OSS (#15490)
|
5 meses atrás |
Julien Denize
|
b2caf67db1
convert : make Mistral community chat templates optional via parameter (#15420)
|
5 meses atrás |
Sigbjørn Skjæret
|
4d196981d4
convert : force patch_embd weights to F16 or F32 to avoid broken GGUFs (#15367)
|
5 meses atrás |
Tarek Dakhran
|
65349f26f2
model : support vision LiquidAI LFM2-VL family (#15347)
|
5 meses atrás |
Sigbjørn Skjæret
|
50e81bdf5d
convert : fix merge conflicts (#15229)
|
5 meses atrás |
Julien Denize
|
a3a7874272
convert : improve Mistral models integration (#14737)
|
5 meses atrás |
Xuan-Son Nguyen
|
50aa938901
convert : support non-mxfp4 HF model (#15153)
|
6 meses atrás |
RunningLeon
|
99acbc9921
llama : Support intern-s1 (#14875)
|
6 meses atrás |
Georgi Gerganov
|
fd1234cb46
llama : add gpt-oss (#15091)
|
6 meses atrás |
Sam
|
ef0144c087
model: support GLM 4.5 family of models (#14939)
|
6 meses atrás |
Csaba Kecskemeti
|
97366dc6ab
vocab : JetBrains Mellum pre-tokenizer (#15045)
|
6 meses atrás |
Gabriel Larson
|
83bc2f288c
model : add text-only support for Kimi-VL (and find special tokens in text_config) (#15051)
|
6 meses atrás |
Douglas Hanley
|
711d5e6fe6
convert : fix Qwen3-Embedding pre-tokenizer hash (#15030)
|
6 meses atrás |
Douglas Hanley
|
339bd0268c
model : support Qwen3-Embedding (#15023)
|
6 meses atrás |
stevenkuang
|
0f5ccd6fd1
model : add hunyuan dense (#14878)
|
6 meses atrás |
Aman Gupta
|
8a4a856277
Add LLaDA 8b Diffusion model (#14771)
|
6 meses atrás |
Xuan-Son Nguyen
|
00fa15fedc
mtmd : add support for Voxtral (#14862)
|
6 meses atrás |
Dongliang Wei
|
6c6e397aff
model : add support for SmallThinker series (#14898)
|
6 meses atrás |
Shunta Saito
|
1dc9614e06
llama : fix kq_scale for the attention layers of PLaMo2 (#14892)
|
6 meses atrás |
jacekpoplawski
|
a12363bbf0
convert : text-only support for GLM-4.1V-9B-Thinking (#14823)
|
6 meses atrás |
lgai-exaone
|
e0cb5c5cb8
model : add EXAONE 4.0 support (#14630)
|
6 meses atrás |