xaedes
|
44c117f41e
train : mem usage and other improvements (#2439)
|
2 years ago |
Georgi Gerganov
|
edd4c14817
llama : more tokenizer fixes (#2810)
|
2 years ago |
Georgi Gerganov
|
ef3f333d37
ggml : sync latest (SAM + SD operators, CUDA alibi) (#2709)
|
2 years ago |
Georgi Gerganov
|
6381d4e110
gguf : new file format with flexible meta data (beta) (#2398)
|
2 years ago |
Kawrakow
|
eb542d3932
Add LLAMA_DEFAULT_RMS_EPS so we can change the default (#2384)
|
2 years ago |
slaren
|
41c674161f
make rms_norm_eps a parameter (#2374)
|
2 years ago |
Georgi Gerganov
|
513f861953
ggml : fix rope args order + assert (#2054)
|
2 years ago |
Spencer Sutton
|
5bf2a27718
ggml : remove src0 and src1 from ggml_tensor and rename opt to src (#2178)
|
2 years ago |
Qingyou Meng
|
1d656d6360
ggml : change ggml_graph_compute() API to not require context (#1999)
|
2 years ago |
Georgi Gerganov
|
04606a1599
train : fix compile warning
|
2 years ago |
Howard Su
|
b8c8dda75f
Use unsigned for random seed (#2006)
|
2 years ago |
Georgi Gerganov
|
181e8d9755
llama : fix rope usage after ChatGLM change
|
2 years ago |
David Yang
|
eaa6ca5a61
ggml : increase max tensor name + clean up compiler warnings in train-text (#1988)
|
2 years ago |
Didzis Gosko
|
527b6fba1d
llama : make model stateless and context stateful (llama_state) (#1797)
|
2 years ago |
Borislav Stanimirov
|
9cbf50c041
build : fix and ignore MSVC warnings (#1889)
|
2 years ago |
xaedes
|
e32089b2c2
train : improved training-from-scratch example (#1652)
|
2 years ago |