Junwon Hwang
|
60591f01d4
model : add EXAONE MoE (#18543)
|
2 weeks ago |
Xuan-Son Nguyen
|
506bb6e010
model: try to improve Qwen3 Next (#18683)
|
2 weeks ago |
Tarek Dakhran
|
73d284a250
model : add LFM2-ColBert-350M (#18607)
|
3 weeks ago |
Prabod
|
5755e52d15
model : Maincoder-1B support (#18534)
|
3 weeks ago |
momonga
|
9c675c7140
model : Plamo3 support (#17304)
|
1 month ago |
Xuan-Son Nguyen
|
4cbafad4f0
model: support MiMo-V2-Flash (#18328)
|
1 month ago |
Saba Fallah
|
54132f1b1f
model : support for LlamaBidirectionalModel architecture (#18220)
|
1 month ago |
Ryan Mangeno
|
dfc959b886
model : Granite Embedding support (#15641)
|
1 month ago |
Tarek Dakhran
|
982060fadc
model: fix LFM2_MOE missing tensors (#18132)
|
1 month ago |
Xuan-Son Nguyen
|
7f2b2f3c77
arch: refactor LLM_TENSOR_NAMES (#18051)
|
1 month ago |
Daniel Bevenius
|
2995341730
llama : add support for NVIDIA Nemotron 3 Nano (#18058)
|
1 month ago |
Piotr Wilkin (ilintar)
|
746f9ee889
Override SSM_A op for Qwen3 Next to reduce splits (#17587)
|
1 month ago |
Xuan-Son Nguyen
|
cd3c118908
model: support Ministral3 (#17644)
|
1 month ago |
Piotr Wilkin (ilintar)
|
ff55414c42
model : Qwen3 Next (#16095)
|
2 months ago |
Georgi Gerganov
|
c386114922
arch : add description about LLM_TENSOR_INFOS (#17550)
|
2 months ago |
Georgi Gerganov
|
6783b11fb0
models : fix LFM2 tensors (#17548)
|
2 months ago |
Aaron Teo
|
877566d512
llama: introduce support for model-embedded sampling parameters (#17120)
|
2 months ago |
william pan
|
4902eebe33
models : Added support for RND1 Diffusion Language Model (#17433)
|
2 months ago |
Bartowski
|
e1fcf8b09b
model : add AfmoeForCausalLM support (#16477)
|
2 months ago |
Li Pengzhan
|
9f052478c2
model : add openPangu-Embedded (#16941)
|
2 months ago |
Piotr Wilkin (ilintar)
|
0de0a01576
model : Minimax M2 (#16831)
|
3 months ago |
JJJYmmm
|
d261223d24
model: add support for qwen3vl series (#16780)
|
3 months ago |
Tianyue-Zhao
|
bacddc049a
model: Add support for CogVLM model (#15002)
|
3 months ago |
Sigbjørn Skjæret
|
84bf3c6778
model : add BailingMoeV2 support (#16063)
|
3 months ago |
Xuan-Son Nguyen
|
3e3cb19f64
llama-quant: add support for mmproj (#16592)
|
3 months ago |
Saba Fallah
|
e08db42595
model: EmbeddingGemma Adding Support for SentenceTransformers Dense Modules (#16367)
|
3 months ago |
Tarek Dakhran
|
aeaf8a36f0
llama : support LiquidAI LFM2-MoE hybrid model (#16464)
|
3 months ago |
Piotr Wilkin (ilintar)
|
34fcc5a4ac
model : Apertus model implementation (#15852)
|
3 months ago |
Sigbjørn Skjæret
|
835b2b915c
model : add GroveMoE support (#15510)
|
4 months ago |
Douglas Hanley
|
b5bd037832
llama : add support for qwen3 reranker (#15824)
|
4 months ago |