Liu Jia
|
0a4ce78681
common : Changed tuple to struct (TODO fix) (#8823)
|
1 год назад |
Yann Follet
|
646ef4a9cf
embedding : more cli arguments (#7458)
|
1 год назад |
Douglas Hanley
|
80ea089d77
llama : allow pooled embeddings on any model (#7477)
|
1 год назад |
Georgi Gerganov
|
1442677f92
common : refactor cli arg parsing (#7675)
|
1 год назад |
Georgi Gerganov
|
6ff13987ad
common : normalize naming style (#7462)
|
1 год назад |
dm4
|
ea3b0590ee
embedding : free the batch after execution (#7297)
|
1 год назад |
Joan Fontanals
|
b83cc3f5b3
llama : add Jina Embeddings architecture (#6826)
|
1 год назад |
Jared Van Bortel
|
1b67731e18
BERT tokenizer fixes (#6498)
|
1 год назад |
howlger
|
1e13987fba
embedding : show full embedding for single prompt (#6342)
|
1 год назад |
Minsoo Cheong
|
deb7240100
embedding : adjust `n_ubatch` value (#6296)
|
1 год назад |
Georgi Gerganov
|
044ec4b2a5
embedding : add EOS token if not present (#899)
|
1 год назад |
Georgi Gerganov
|
68265ebfc6
embedding : print all resulting embeddings (#899)
|
1 год назад |
Georgi Gerganov
|
0fd6c1f015
embedding : print cosine similarity (#899)
|
1 год назад |
slaren
|
f30ea47a87
llama : add pipeline parallelism support (#6017)
|
1 год назад |
SeungWon Jeong
|
fb215c3832
server : normalize embeddings (#5956)
|
1 год назад |
Georgi Gerganov
|
29ae62d2ae
llama : fix embeddings (#5796)
|
1 год назад |
bmwl
|
f486f6e1e5
ggml : add numa options (#5377)
|
1 год назад |
Douglas Hanley
|
03bf161eb6
llama : support batched embeddings (#5466)
|
1 год назад |
Douglas Hanley
|
2891c8aa9a
Add support for BERT embedding models (#5423)
|
1 год назад |
cebtenzzre
|
b12fa0d1c1
build : link against build info instead of compiling against it (#3879)
|
2 лет назад |
slaren
|
16bc66d947
llama.cpp : split llama_context_params into model and context params (#3301)
|
2 лет назад |
Georgi Gerganov
|
ec893798b7
llama : custom attention mask + parallel decoding + no context swaps (#3228)
|
2 лет назад |
Cebtenzzre
|
8781013ef6
make : restore build-info.h dependency for several targets (#3205)
|
2 лет назад |
Cebtenzzre
|
e6616cf0db
examples : add compiler version and target to build info (#2998)
|
2 лет назад |
Cebtenzzre
|
e64f5b5578
examples : make n_ctx warning work again (#3066)
|
2 лет назад |
Cebtenzzre
|
00d62adb79
fix some warnings from gcc and clang-tidy (#3038)
|
2 лет назад |
Georgi Gerganov
|
edd4c14817
llama : more tokenizer fixes (#2810)
|
2 лет назад |
slaren
|
519c981f8b
embedding : evaluate prompt in batches (#2713)
|
2 лет назад |
Georgi Gerganov
|
6381d4e110
gguf : new file format with flexible meta data (beta) (#2398)
|
2 лет назад |
Evan Miller
|
5656d10599
mpi : add support for distributed inference via MPI (#2099)
|
2 лет назад |