slaren
|
49006c67b4
llama : move random seed generation to the samplers (#9398)
|
1 年之前 |
Xuan Son Nguyen
|
bfe76d4a17
common : move arg parser code to `arg.cpp` (#9388)
|
1 年之前 |
Xuan Son Nguyen
|
3f7ccfd649
common : bring back missing args, add env var duplication check (#9375)
|
1 年之前 |
Xuan Son Nguyen
|
1b9ae5189c
common : refactor arg parser (#9308)
|
1 年之前 |
Georgi Gerganov
|
df270ef745
llama : refactor sampling v2 (#9294)
|
1 年之前 |
Zhenwei Jin
|
4af8420afb
common : remove duplicate function llama_should_add_bos_token (#8778)
|
1 年之前 |
Liu Jia
|
0a4ce78681
common : Changed tuple to struct (TODO fix) (#8823)
|
1 年之前 |
Georgi Gerganov
|
6f0dbf6ab0
infill : assert prefix/suffix tokens + remove old space logic (#8351)
|
1 年之前 |
Clint Herron
|
07a3fc0608
Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258)
|
1 年之前 |
Sigbjørn Skjæret
|
38373cfbab
Add SPM infill support (#8016)
|
1 年之前 |
Sigbjørn Skjæret
|
91c188d6c2
Only use FIM middle token if it exists (#7648)
|
1 年之前 |
Georgi Gerganov
|
1442677f92
common : refactor cli arg parsing (#7675)
|
1 年之前 |
Georgi Gerganov
|
6ff13987ad
common : normalize naming style (#7462)
|
1 年之前 |
Pedro Cuenca
|
b97bc3966e
llama : support Llama 3 HF conversion (#6745)
|
1 年之前 |
Jared Van Bortel
|
1b67731e18
BERT tokenizer fixes (#6498)
|
1 年之前 |
Jared Van Bortel
|
4d4d2366fc
convert : automatically fall back to HfVocab if tokenizer.model doesn't exist (#5821)
|
1 年之前 |
Georgi Gerganov
|
bf08e00643
llama : refactor k-shift implementation + KV defragmentation (#5691)
|
1 年之前 |
bmwl
|
f486f6e1e5
ggml : add numa options (#5377)
|
1 年之前 |
Michael Klimenko
|
35a2ee9143
Remove unused data and add fixes (#5154)
|
2 年之前 |
Seb C
|
881800d1f0
main : Add ChatML functionality to main example (#4046)
|
2 年之前 |
Kerfuffle
|
91f6499393
Respect tokenizer.ggml.add_bos_token value when tokenizing (#4040)
|
2 年之前 |
cebtenzzre
|
b12fa0d1c1
build : link against build info instead of compiling against it (#3879)
|
2 年之前 |
Marcus Dunn
|
5be6c803fa
llama : remove token functions with `context` args in favor of `model` (#3720)
|
2 年之前 |
Georgi Gerganov
|
d1031cf49c
sampling : refactor init to use llama_sampling_params (#3696)
|
2 年之前 |
Georgi Gerganov
|
0e89203b51
speculative : add tree-based sampling example (#3624)
|
2 年之前 |
Kerfuffle
|
70c29da118
common : fix mirostat state when using multiple sequences (#3543)
|
2 年之前 |
vvhg1
|
11ea5c7d96
infill. : fix tokenization (#3508)
|
2 年之前 |
vvhg1
|
c97f01c362
infill : add new example + extend server API (#3296)
|
2 年之前 |