cturan/llama.cpp

Author	SHA1 Message	Date
Joan Fontanals	b83cc3f5b3 llama : add Jina Embeddings architecture (#6826)	1 year ago
Jared Van Bortel	1b67731e18 BERT tokenizer fixes (#6498)	1 year ago
howlger	1e13987fba embedding : show full embedding for single prompt (#6342)	1 year ago
Minsoo Cheong	deb7240100 embedding : adjust `n_ubatch` value (#6296)	1 year ago
Georgi Gerganov	044ec4b2a5 embedding : add EOS token if not present (#899)	1 year ago
Georgi Gerganov	68265ebfc6 embedding : print all resulting embeddings (#899)	1 year ago
Georgi Gerganov	0fd6c1f015 embedding : print cosine similarity (#899)	1 year ago
slaren	f30ea47a87 llama : add pipeline parallelism support (#6017)	1 year ago
SeungWon Jeong	fb215c3832 server : normalize embeddings (#5956)	1 year ago
Georgi Gerganov	29ae62d2ae llama : fix embeddings (#5796)	1 year ago
bmwl	f486f6e1e5 ggml : add numa options (#5377)	1 year ago
Douglas Hanley	03bf161eb6 llama : support batched embeddings (#5466)	1 year ago
Douglas Hanley	2891c8aa9a Add support for BERT embedding models (#5423)	1 year ago
cebtenzzre	b12fa0d1c1 build : link against build info instead of compiling against it (#3879)	2 years ago
slaren	16bc66d947 llama.cpp : split llama_context_params into model and context params (#3301)	2 years ago
Georgi Gerganov	ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228)	2 years ago
Cebtenzzre	8781013ef6 make : restore build-info.h dependency for several targets (#3205)	2 years ago
Cebtenzzre	e6616cf0db examples : add compiler version and target to build info (#2998)	2 years ago
Cebtenzzre	e64f5b5578 examples : make n_ctx warning work again (#3066)	2 years ago
Cebtenzzre	00d62adb79 fix some warnings from gcc and clang-tidy (#3038)	2 years ago
Georgi Gerganov	edd4c14817 llama : more tokenizer fixes (#2810)	2 years ago
slaren	519c981f8b embedding : evaluate prompt in batches (#2713)	2 years ago
Georgi Gerganov	6381d4e110 gguf : new file format with flexible meta data (beta) (#2398)	2 years ago
Evan Miller	5656d10599 mpi : add support for distributed inference via MPI (#2099)	2 years ago
Judd	36680f6e40 convert : update for baichuan (#2081)	2 years ago
Howard Su	b8c8dda75f Use unsigned for random seed (#2006)	2 years ago
zrm	b853d45601 ggml : add NUMA support (#1556)	2 years ago
Didzis Gosko	527b6fba1d llama : make model stateless and context stateful (llama_state) (#1797)	2 years ago
Borislav Stanimirov	9cbf50c041 build : fix and ignore MSVC warnings (#1889)	2 years ago
Georgi Gerganov	ec2e10c444 llama : add llama_init_backend() API (close #1527)	2 years ago

Newer Older

Commit History Find

Commit History