Xuan-Son Nguyen
|
cb9178f885
llama : remove llm_graph_input_one (#14603)
|
6 месяцев назад |
compilade
|
4a5686da22
llama : support Jamba hybrid Transformer-Mamba models (#7531)
|
6 месяцев назад |
Sigbjørn Skjæret
|
105554595f
llama : remove unintended whitespace (#14592)
|
6 месяцев назад |
ibrahim khadraoui
|
04655063c4
model : add support for Falcon-H1 family (#14534)
|
6 месяцев назад |
Xuan-Son Nguyen
|
08382869a2
model : add SmolLM3 (#14581)
|
6 месяцев назад |
Xuan-Son Nguyen
|
8f22dc0a53
model : add hunyuan moe (#14425)
|
6 месяцев назад |
Sigbjørn Skjæret
|
e1a7059053
llama : fix incorrect minicpm3 v_states shape (#14571)
|
6 месяцев назад |
Sigbjørn Skjæret
|
12f55c302b
llama : remove ggml_cont where possible (#14568)
|
6 месяцев назад |
compilade
|
5d46babdc2
llama : initial Mamba-2 support (#9126)
|
6 месяцев назад |
Weizhao Ouyang
|
566c16fcce
model : add support for ERNIE 4.5 0.3B model (#14408)
|
6 месяцев назад |
Xuan-Son Nguyen
|
8846aace49
model : gemma3n text-only (#14400)
|
6 месяцев назад |
Sigbjørn Skjæret
|
b25346221d
llama : return mistral-v7-tekken as default template only (#14390)
|
6 месяцев назад |
Georgi Gerganov
|
692e3cdd0a
memory : rename interface to llama_memory_context_i (#14296)
|
7 месяцев назад |
Georgi Gerganov
|
812939a9e9
model : more uniform output id handling (#14275)
|
7 месяцев назад |
Gabe Goodhart
|
edc4a29eff
memory : Hybrid recurrent cache (#13979)
|
7 месяцев назад |
Đinh Trọng Huy
|
ad590be98c
model : add NeoBERT (#14164)
|
7 месяцев назад |
Bartowski
|
d7da8dc83a
model : Add support for Arcee AI's upcoming AFM model (#14185)
|
7 месяцев назад |
Mikko Juola
|
9ae4143bc6
model : add dots.llm1 architecture support (#14044) (#14118)
|
7 месяцев назад |
compilade
|
dad5c44398
kv-cache : avoid modifying recurrent cells when setting inputs (#13834)
|
7 месяцев назад |
Sigbjørn Skjæret
|
3678b838bb
llama : support GEGLU for jina-bert-v2 (#14090)
|
7 месяцев назад |
Sigbjørn Skjæret
|
0974ad7a7c
llama : fix llama_model_chat_template with template name (LLM_KV with suffix) (#14050)
|
7 месяцев назад |
Sigbjørn Skjæret
|
d17a809ef0
llama : support multiple classifier outputs and labels (#13940)
|
7 месяцев назад |
Georgi Gerganov
|
5582c49c39
gemma : more consistent attention scaling for v2 and v3 (#13951)
|
7 месяцев назад |
Georgi Gerganov
|
0fc16b42e8
kv-cache : split implementation in separate sources (#13920)
|
7 месяцев назад |
Georgi Gerganov
|
3600cc2886
llama : use n_swa + n_ubatch cells for SWA cache (#13833)
|
7 месяцев назад |
Georgi Gerganov
|
12d0188c0d
kv-cache : refactor + add llama_memory_state_i (#13746)
|
7 месяцев назад |
Đinh Trọng Huy
|
291f2b6913
llama : add support for DistilBert (#13907)
|
7 месяцев назад |
zhangkaihuo
|
2c90da4c7e
llama : use llm_build_granite for minicpm (#13911)
|
7 месяцев назад |
Sigbjørn Skjæret
|
e83ba3e460
llama : add support for jina-reranker-v2 (#13900)
|
7 месяцев назад |
Sigbjørn Skjæret
|
6385b843a8
llama : add RobertaForSequenceClassification reranker support (#13875)
|
7 месяцев назад |