compilade
|
5d46babdc2
llama : initial Mamba-2 support (#9126)
|
6 months ago |
Weizhao Ouyang
|
566c16fcce
model : add support for ERNIE 4.5 0.3B model (#14408)
|
6 months ago |
Xuan-Son Nguyen
|
8846aace49
model : gemma3n text-only (#14400)
|
7 months ago |
Mikko Juola
|
9ae4143bc6
model : add dots.llm1 architecture support (#14044) (#14118)
|
7 months ago |
Sigbjørn Skjæret
|
d17a809ef0
llama : support multiple classifier outputs and labels (#13940)
|
7 months ago |
Georgi Gerganov
|
e298d2fbd0
kv-cache : add SWA support (#13194)
|
8 months ago |
Johannes Gäßler
|
10d2af0eaa
llama/ggml: add LLM training support (#10544)
|
8 months ago |
ymcki
|
3bf785f3ef
llama : Llama-3_1-Nemotron-Ultra-253B-v1 support (#12843)
|
8 months ago |
Georgi Gerganov
|
c642bc014c
kv-cache : separate recurrent vs non-recurrent impl (#12799)
|
8 months ago |
Jared Van Bortel
|
a70183eb00
llama-model : fix the reported size class for nomic-embed-text-v2-moe (#13223)
|
8 months ago |
Sigbjørn Skjæret
|
7d3af70b08
llama : llm_type order by size (#13177)
|
8 months ago |
Sigbjørn Skjæret
|
e98b3692be
llama : set qwen3 model type sizes (#13175)
|
8 months ago |
Juk Armstrong
|
daa422881a
llama : DeepSeek V2/V3 MLA implementation (#12801)
|
9 months ago |
Xuan-Son Nguyen
|
1466621e73
llama : Support llama 4 text-only (#12791)
|
9 months ago |
Diego Devesa
|
e0e912f49b
llama : add option to override model tensor buffers (#11397)
|
9 months ago |
Sigbjørn Skjæret
|
2c3f8b850a
llama : support BailingMoE (Ling) (#12634)
|
9 months ago |
Si1w
|
f125b8dccf
llama : add PLM GGUF Conversion & Inference Support (#12457)
|
10 months ago |
Molly Sophia
|
7dfad387e3
llama: Add support for RWKV v7 architecture (#12412)
|
10 months ago |
Georgi Gerganov
|
e0dbec0bc6
llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181)
|
10 months ago |
Radoslav Gerganov
|
667d72846c
rpc : early register backend devices (#11262)
|
1 year ago |
Georgi Gerganov
|
afa8a9ec9b
llama : add `llama_vocab`, functions -> methods, naming (#11110)
|
1 year ago |
Molly Sophia
|
ee7136c6d1
llama: add support for QRWKV6 model architecture (#11001)
|
1 year ago |
Pierrick Hymbert
|
f8feb4b01a
model: Add support for PhiMoE arch (#11003)
|
1 year ago |
fairydreaming
|
9394bbd484
llama : Add support for DeepSeek V3 (#11049)
|
1 year ago |
Georgi Gerganov
|
f66f582927
llama : refactor `src/llama.cpp` (#10902)
|
1 year ago |