Georgi Gerganov
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 months ago |
Sam
|
ef0144c087
model: support GLM 4.5 family of models (#14939)
|
5 months ago |
Dongliang Wei
|
6c6e397aff
model : add support for SmallThinker series (#14898)
|
6 months ago |
Gabriel Larson
|
4762ad7316
model : make rope_yarn_log_mul optional for deepseek2 (#14896)
|
6 months ago |
Georgi Gerganov
|
225e7a1438
llama : add high-throughput mode (#14363)
|
6 months ago |
Gabriel Larson
|
4a4f426944
model : add Kimi-K2 support (#14654)
|
6 months ago |
Tarek Dakhran
|
f5e96b368f
model : support LiquidAI LFM2 hybrid family (#14620)
|
6 months ago |
compilade
|
5d46babdc2
llama : initial Mamba-2 support (#9126)
|
6 months ago |
Xuan-Son Nguyen
|
8846aace49
model : gemma3n text-only (#14400)
|
7 months ago |
Georgi Gerganov
|
4c9fdfbe15
ubatch : new splitting logic (#14217)
|
7 months ago |
Gabe Goodhart
|
edc4a29eff
memory : Hybrid recurrent cache (#13979)
|
7 months ago |
Sigbjørn Skjæret
|
6385b843a8
llama : add RobertaForSequenceClassification reranker support (#13875)
|
8 months ago |
Georgi Gerganov
|
d13d0f6135
hparams : initialize arrays (#13728)
|
8 months ago |
Xuan-Son Nguyen
|
8a2afb7520
llama : allow custom list of swa_layers (#13726)
|
8 months ago |
Georgi Gerganov
|
8e186ef0e7
hparams : support models for which all layers use SWA (#13682)
|
8 months ago |
Georgi Gerganov
|
e298d2fbd0
kv-cache : add SWA support (#13194)
|
8 months ago |
AT
|
5f5e39e1ba
model : Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture (#12466)
|
9 months ago |
Juk Armstrong
|
daa422881a
llama : DeepSeek V2/V3 MLA implementation (#12801)
|
9 months ago |
Xuan-Son Nguyen
|
1466621e73
llama : Support llama 4 text-only (#12791)
|
9 months ago |
Molly Sophia
|
7dfad387e3
llama: Add support for RWKV v7 architecture (#12412)
|
10 months ago |
Georgi Gerganov
|
081bee8c64
hparams : add SWA rope parameters (#12374)
|
10 months ago |
Georgi Gerganov
|
84d5475541
llama : fix Gemma3 SWA KV cache shift (#12373)
|
10 months ago |
Georgi Gerganov
|
afa8a9ec9b
llama : add `llama_vocab`, functions -> methods, naming (#11110)
|
1 year ago |
Molly Sophia
|
ee7136c6d1
llama: add support for QRWKV6 model architecture (#11001)
|
1 year ago |
fairydreaming
|
9394bbd484
llama : Add support for DeepSeek V3 (#11049)
|
1 year ago |
Georgi Gerganov
|
f66f582927
llama : refactor `src/llama.cpp` (#10902)
|
1 year ago |