Georgi Gerganov
|
0c731ca403
prompts : fix editorconfig checks after #3416
|
il y a 2 ans |
pudepiedj
|
a8777ad84e
parallel : add option to load external prompt file (#3416)
|
il y a 2 ans |
Jhen-Jie Hong
|
97af49fa39
server : reuse llama_sample_token common util (#3494)
|
il y a 2 ans |
l3utterfly
|
16820a5a0d
llama : correct hparams comparison (#3446)
|
il y a 2 ans |
Jhen-Jie Hong
|
04b2f4386e
ci : fix xcodebuild destinations (#3491)
|
il y a 2 ans |
cebtenzzre
|
48edda30ee
convert : update Falcon script for new HF config (#3448)
|
il y a 2 ans |
Kenvix ⭐
|
45eba9369f
build : use std::make_tuple() for compatibility with older GCC versions (#3488)
|
il y a 2 ans |
staviq
|
acec9eaaa9
common : process escape sequences in reverse prompts (#3461)
|
il y a 2 ans |
shibe2
|
e2583cbc29
CLBlast: Fix handling of on-device tensor data
|
il y a 2 ans |
Jhen-Jie Hong
|
e8b8d32e86
server : fix incorrect num_tokens_predicted (#3480)
|
il y a 2 ans |
Jhen-Jie Hong
|
8f3a642ec1
swift : disable ACCELERATE_NEW_LAPACK (#3481)
|
il y a 2 ans |
Jhen-Jie Hong
|
0745384449
ci : add swift build via xcodebuild (#3482)
|
il y a 2 ans |
Kerfuffle
|
019ba1dcd0
convert : fix Baichuan2 models by using vocab size in config.json (#3299)
|
il y a 2 ans |
Georgi Gerganov
|
beabc8cfb0
readme : add project status link
|
il y a 2 ans |
Georgi Gerganov
|
0d152b37fe
ggml : fix build after #3329
|
il y a 2 ans |
ds5t5
|
f8c90cdbaa
llm : add Refact model (#3329)
|
il y a 2 ans |
Georgi Gerganov
|
f93af02488
sync : ggml (conv 1d + 2d updates, UB fixes) (#3468)
|
il y a 2 ans |
Merrick Christensen
|
f72f8f22c9
finetune : readme fix typo (#3465)
|
il y a 2 ans |
Tameem
|
79f34abddb
ggml : add RISC-V Vector Support for K-Quants and improved the existing intrinsics (#3453)
|
il y a 2 ans |
h-h-h-h
|
8186242b6d
main : consistent prefix/suffix coloring (#3425)
|
il y a 2 ans |
Georgi Gerganov
|
ac2219fef3
llama : fix session saving/loading (#3400)
|
il y a 2 ans |
Alex Klinkhamer
|
48be797ffb
llama : expose model's rope_freq_scale in the API (#3418)
|
il y a 2 ans |
Jiahao Li
|
f56e1baec3
metal : alibi for arbitrary number of heads (#3426)
|
il y a 2 ans |
Eve
|
017efe899d
cmake : make LLAMA_NATIVE flag actually use the instructions supported by the processor (#3273)
|
il y a 2 ans |
goerch
|
ff5a3f0c09
Work on the BPE tokenizer (#3252)
|
il y a 2 ans |
cebtenzzre
|
1c84003c08
convert : fix vocab size when not defined in hparams (#3421)
|
il y a 2 ans |
cebtenzzre
|
e78f0b0d05
cmake : increase minimum version for add_link_options (#3444)
|
il y a 2 ans |
shibe2
|
665018c749
CLBlast: Add broadcast support for matrix multiplication (#3402)
|
il y a 2 ans |
cebtenzzre
|
29a404a951
gguf : add BERT, MPT, and GPT-J arch info (#3408)
|
il y a 2 ans |
cebtenzzre
|
0fe321031a
gguf : general usability improvements (#3409)
|
il y a 2 ans |