Mikko Juola
|
971f245b3b
llama : recognize IBM Granite 3.3 FIM tokens (#12988)
|
9 months ago |
Yuxuan Zhang
|
06bb53ad9b
llama-model : add Glm4Model implementation for GLM-4-0414 (#12867)
|
9 months ago |
Xuan-Son Nguyen
|
1466621e73
llama : Support llama 4 text-only (#12791)
|
9 months ago |
yumeyao
|
5dd5d1ab00
vocab : use string_view::find() to avoid unnecessary looking up beyond the fragment range (#12706)
|
9 months ago |
Sigbjørn Skjæret
|
83a88bd6af
vocab : BailingMoE : change possessive quantifiers to greedy (#12677)
|
9 months ago |
Daniel Bevenius
|
c80a7759da
vocab : add special infill tokens for CodeLlama (#11850)
|
9 months ago |
Sigbjørn Skjæret
|
2c3f8b850a
llama : support BailingMoE (Ling) (#12634)
|
9 months ago |
Juyoung Suk
|
b3de7cac73
llama : add Trillion 7B model support (#12556)
|
9 months ago |
compilade
|
00d53800e0
llama-vocab : add SuperBPE pre-tokenizer (#12532)
|
10 months ago |
mgroeber9110
|
5bbe6a9fe9
ggml : portability fixes for VS 2017 (#12150)
|
10 months ago |
Xuan-Son Nguyen
|
c43a3e7996
llama : add Phi-4-mini support (supersede #12099) (#12108)
|
10 months ago |
mgroeber9110
|
ffd0821c57
vocab : correctly identify LF token for GPT-2 style BPE tokenizer (#11496)
|
11 months ago |
lexasub
|
a5203b4465
llama : minor fixes for up llama load model speed (#11448)
|
11 months ago |
Xuan Son Nguyen
|
ec7f3ac9ab
llama : add support for Deepseek-R1-Qwen distill model (#11310)
|
1 year ago |
Georgi Gerganov
|
a133566d34
vocab : fix double-eos check (#11273)
|
1 year ago |
Georgi Gerganov
|
bbf3e55e35
vocab : add dummy tokens for "no_vocab" type (#11231)
|
1 year ago |
Daniel Bevenius
|
8f70fc3d1b
llama : remove 'd' from bad special token log (#11212)
|
1 year ago |
Georgi Gerganov
|
08f10f69c3
llama : remove notion of CLS token (#11064)
|
1 year ago |
Georgi Gerganov
|
afa8a9ec9b
llama : add `llama_vocab`, functions -> methods, naming (#11110)
|
1 year ago |
Georgi Gerganov
|
727368c60f
llama : use LLAMA_TOKEN_NULL (#11062)
|
1 year ago |
fairydreaming
|
9394bbd484
llama : Add support for DeepSeek V3 (#11049)
|
1 year ago |
Georgi Gerganov
|
f66f582927
llama : refactor `src/llama.cpp` (#10902)
|
1 year ago |
Georgi Gerganov
|
30caac3a68
llama : the WPM vocabs use the CLS token as BOS (#10930)
|
1 year ago |
Georgi Gerganov
|
0bf2d10c55
tts : add OuteTTS support (#10784)
|
1 year ago |
Georgi Gerganov
|
08ea539df2
unicode : improve naming style (#10838)
|
1 year ago |
Riccardo Orlando
|
6fe6247831
llama : add Minerva 7B model support (#10673)
|
1 year ago |
wwoodsTM
|
ff252ea48e
llama : add DRY sampler (#9702)
|
1 year ago |
Georgi Gerganov
|
99bd4ac28c
llama : infill sampling handle very long tokens (#9924)
|
1 year ago |
Daniel Bevenius
|
9e04102448
llama : suppress conversion from 'size_t' to 'int' (#9046)
|
1 year ago |
Georgi Gerganov
|
755a9b2bf0
llama : add infill sampler (#9896)
|
1 year ago |