Junwon Hwang
|
60591f01d4
model : add EXAONE MoE (#18543)
|
3 viikkoa sitten |
Tarek Dakhran
|
73d284a250
model : add LFM2-ColBert-350M (#18607)
|
1 kuukausi sitten |
Prabod
|
5755e52d15
model : Maincoder-1B support (#18534)
|
1 kuukausi sitten |
momonga
|
9c675c7140
model : Plamo3 support (#17304)
|
1 kuukausi sitten |
Xuan-Son Nguyen
|
4cbafad4f0
model: support MiMo-V2-Flash (#18328)
|
1 kuukausi sitten |
Saba Fallah
|
54132f1b1f
model : support for LlamaBidirectionalModel architecture (#18220)
|
1 kuukausi sitten |
Ryan Mangeno
|
dfc959b886
model : Granite Embedding support (#15641)
|
1 kuukausi sitten |
Xuan-Son Nguyen
|
7f2b2f3c77
arch: refactor LLM_TENSOR_NAMES (#18051)
|
1 kuukausi sitten |
Daniel Bevenius
|
2995341730
llama : add support for NVIDIA Nemotron 3 Nano (#18058)
|
1 kuukausi sitten |
Piotr Wilkin (ilintar)
|
746f9ee889
Override SSM_A op for Qwen3 Next to reduce splits (#17587)
|
2 kuukautta sitten |
Xuan-Son Nguyen
|
cd3c118908
model: support Ministral3 (#17644)
|
2 kuukautta sitten |
Piotr Wilkin (ilintar)
|
ff55414c42
model : Qwen3 Next (#16095)
|
2 kuukautta sitten |
Aaron Teo
|
877566d512
llama: introduce support for model-embedded sampling parameters (#17120)
|
2 kuukautta sitten |
william pan
|
4902eebe33
models : Added support for RND1 Diffusion Language Model (#17433)
|
2 kuukautta sitten |
Bartowski
|
e1fcf8b09b
model : add AfmoeForCausalLM support (#16477)
|
2 kuukautta sitten |
Li Pengzhan
|
9f052478c2
model : add openPangu-Embedded (#16941)
|
3 kuukautta sitten |
Piotr Wilkin (ilintar)
|
0de0a01576
model : Minimax M2 (#16831)
|
3 kuukautta sitten |
JJJYmmm
|
d261223d24
model: add support for qwen3vl series (#16780)
|
3 kuukautta sitten |
Tianyue-Zhao
|
bacddc049a
model: Add support for CogVLM model (#15002)
|
3 kuukautta sitten |
Sigbjørn Skjæret
|
84bf3c6778
model : add BailingMoeV2 support (#16063)
|
3 kuukautta sitten |
Xuan-Son Nguyen
|
3e3cb19f64
llama-quant: add support for mmproj (#16592)
|
3 kuukautta sitten |
Saba Fallah
|
e08db42595
model: EmbeddingGemma Adding Support for SentenceTransformers Dense Modules (#16367)
|
4 kuukautta sitten |
Tarek Dakhran
|
aeaf8a36f0
llama : support LiquidAI LFM2-MoE hybrid model (#16464)
|
4 kuukautta sitten |
Piotr Wilkin (ilintar)
|
34fcc5a4ac
model : Apertus model implementation (#15852)
|
4 kuukautta sitten |
Sigbjørn Skjæret
|
835b2b915c
model : add GroveMoE support (#15510)
|
4 kuukautta sitten |
Aman Gupta
|
6d758839ff
Add LLaDA-7b-MoE diffusion model (#16003)
|
4 kuukautta sitten |
Sigbjørn Skjæret
|
b8e09f08b9
model : add grok-2 support (#15539)
|
4 kuukautta sitten |
Jie Fu (傅杰)
|
4f658855fa
llama : support T5 models with unequal number of encoder-decoder layers (#15909)
|
5 kuukautta sitten |
Gabe Goodhart
|
fd621880f3
aLoRA Support (#15327)
|
5 kuukautta sitten |
Daniel Bevenius
|
fb15d649ed
llama : add support for EmbeddingGemma 300m (#15798)
|
5 kuukautta sitten |