Georgi Gerganov
|
4760e7cc0b
sync : ggml (backend v2) (#3912)
|
2 år sedan |
cebtenzzre
|
898aeca90a
llama : implement YaRN RoPE scaling (#2268)
|
2 år sedan |
slaren
|
a5e8c1d8c7
train-text-from-scratch : fix assert failure in ggml-alloc (#3618)
|
2 år sedan |
Georgi Gerganov
|
bc34dd4f5b
train : fix KQ_pos allocation (#3392)
|
2 år sedan |
Cebtenzzre
|
bc39553c90
build : enable more non-default compiler warnings (#3200)
|
2 år sedan |
slaren
|
16bc66d947
llama.cpp : split llama_context_params into model and context params (#3301)
|
2 år sedan |
xaedes
|
0e76a8992c
train : finetune LORA (#2632)
|
2 år sedan |
Georgi Gerganov
|
ec893798b7
llama : custom attention mask + parallel decoding + no context swaps (#3228)
|
2 år sedan |
goerch
|
b08e75baea
Fixing the last deviations from sentencepiece indicated by test-tokenizer-1 (#3170)
|
2 år sedan |
Cebtenzzre
|
00d62adb79
fix some warnings from gcc and clang-tidy (#3038)
|
2 år sedan |
xaedes
|
44c117f41e
train : mem usage and other improvements (#2439)
|
2 år sedan |
Georgi Gerganov
|
edd4c14817
llama : more tokenizer fixes (#2810)
|
2 år sedan |
Georgi Gerganov
|
ef3f333d37
ggml : sync latest (SAM + SD operators, CUDA alibi) (#2709)
|
2 år sedan |
Georgi Gerganov
|
6381d4e110
gguf : new file format with flexible meta data (beta) (#2398)
|
2 år sedan |
Kawrakow
|
eb542d3932
Add LLAMA_DEFAULT_RMS_EPS so we can change the default (#2384)
|
2 år sedan |
slaren
|
41c674161f
make rms_norm_eps a parameter (#2374)
|
2 år sedan |
Georgi Gerganov
|
513f861953
ggml : fix rope args order + assert (#2054)
|
2 år sedan |
Spencer Sutton
|
5bf2a27718
ggml : remove src0 and src1 from ggml_tensor and rename opt to src (#2178)
|
2 år sedan |
Qingyou Meng
|
1d656d6360
ggml : change ggml_graph_compute() API to not require context (#1999)
|
2 år sedan |
Georgi Gerganov
|
04606a1599
train : fix compile warning
|
2 år sedan |
Howard Su
|
b8c8dda75f
Use unsigned for random seed (#2006)
|
2 år sedan |
Georgi Gerganov
|
181e8d9755
llama : fix rope usage after ChatGLM change
|
2 år sedan |
David Yang
|
eaa6ca5a61
ggml : increase max tensor name + clean up compiler warnings in train-text (#1988)
|
2 år sedan |
Didzis Gosko
|
527b6fba1d
llama : make model stateless and context stateful (llama_state) (#1797)
|
2 år sedan |
Borislav Stanimirov
|
9cbf50c041
build : fix and ignore MSVC warnings (#1889)
|
2 år sedan |
xaedes
|
e32089b2c2
train : improved training-from-scratch example (#1652)
|
2 år sedan |