piDack
|
0cec062a63
llama : add support for GLM-Edge and GLM-Edge-V series models (#10573)
|
1 year ago |
Georgi Gerganov
|
08f10f69c3
llama : remove notion of CLS token (#11064)
|
1 year ago |
Molly Sophia
|
ee7136c6d1
llama: add support for QRWKV6 model architecture (#11001)
|
1 year ago |
Pierrick Hymbert
|
f8feb4b01a
model: Add support for PhiMoE arch (#11003)
|
1 year ago |
fairydreaming
|
9394bbd484
llama : Add support for DeepSeek V3 (#11049)
|
1 year ago |
DAN™
|
46be942214
llama : add support for the cohere2 model architecture (#10900)
|
1 year ago |
ymcki
|
6f0c9e034b
llama : support for Llama-3_1-Nemotron-51B (#10669)
|
1 year ago |
Georgi Gerganov
|
0bf2d10c55
tts : add OuteTTS support (#10784)
|
1 year ago |
Valentin Mamedov
|
a0974156f3
llama : add Deepseek MoE v1 & GigaChat models (#10827)
|
1 year ago |
HimariO
|
ba1cb19cdd
llama : add Qwen2VL support + multimodal RoPE (#10361)
|
1 year ago |
Robert Collins
|
62e84d9848
llama : add 128k yarn context for Qwen (#10698)
|
1 year ago |
Djip007
|
19d8762ab6
ggml : refactor online repacking (#10446)
|
1 year ago |
JFLFY2255
|
8d0cfd554a
llama: Support MiniCPM-1B (with & w/o longrope) (#10559)
|
1 year ago |
Shane A
|
80acb7b430
Rename Olmo1124 to Olmo2 (#10500)
|
1 year ago |
Shane A
|
a88ad007de
llama : add OLMo November 2024 support (#10394)
|
1 year ago |
Brian
|
a0ec17b32e
metadata: Detailed Dataset Authorship Metadata (#8875)
|
1 year ago |
Georgi Gerganov
|
11ac9800af
llama : improve infill support and special token detection (#9798)
|
1 year ago |
compilade
|
1927378bcc
convert : refactor rope_freqs generation (#9396)
|
1 year ago |
Georgi Gerganov
|
f4d2b8846a
llama : add reranking support (#9510)
|
1 year ago |
nopperl
|
9a913110cf
llama : add support for Chameleon (#8543)
|
1 year ago |
Gabe Goodhart
|
3d6bf6919f
llama : add IBM Granite MoE architecture (#9438)
|
1 year ago |
Gabe Goodhart
|
0d2ec43833
llama : support IBM Granite architecture (#9412)
|
1 year ago |
Shane A
|
0aadac10c7
llama : support OLMoE (#9462)
|
1 year ago |
CarryFun
|
95ca85168b
llama : support MiniCPM3 (#9322)
|
1 year ago |
compilade
|
9bc6db28d0
ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151)
|
1 year ago |
Molly Sophia
|
8f1d81a0b6
llama : support RWKV v6 models (#8980)
|
1 year ago |
Younes Belkada
|
b40eb84895
llama : support for `falcon-mamba` architecture (#9074)
|
1 year ago |
Minsoo Cheong
|
c679e0cb5c
llama : add EXAONE model support (#9025)
|
1 year ago |
Yoshi Suhara
|
2a24c8caa6
Add Nemotron/Minitron GGUF Conversion & Inference Support (#8922)
|
1 year ago |
fairydreaming
|
7c3f55c100
Add support for encoder-only T5 models (#8900)
|
1 year ago |