cturan/llama.cpp

Author	SHA1 Message	Date
Georgi Gerganov	6562e5a4d6 context : allow cache-less context for embeddings (#13108)	8 months ago
Georgi Gerganov	226251ed56 embeddings : fix batch sizes (#13076)	8 months ago
Georgi Gerganov	e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181)	10 months ago
mgroeber9110	5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150)	10 months ago
Georgi Gerganov	afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110)	1 year ago
Georgi Gerganov	f66f582927 llama : refactor `src/llama.cpp` (#10902)	1 year ago
Diego Devesa	7eee341bee common : use common_ prefix for common library functions (#9805)	1 year ago
Georgi Gerganov	f4d2b8846a llama : add reranking support (#9510)	1 year ago
Georgi Gerganov	6262d13e0b common : reimplement logging (#9418)	1 year ago
Georgi Gerganov	0abc6a2c25 llama : llama_perf + option to disable timings during decode (#9355)	1 year ago
slaren	49006c67b4 llama : move random seed generation to the samplers (#9398)	1 year ago
Xuan Son Nguyen	bfe76d4a17 common : move arg parser code to `arg.cpp` (#9388)	1 year ago
Xuan Son Nguyen	1b9ae5189c common : refactor arg parser (#9308)	1 year ago
Georgi Gerganov	df270ef745 llama : refactor sampling v2 (#9294)	1 year ago
fairydreaming	7c3f55c100 Add support for encoder-only T5 models (#8900)	1 year ago
Liu Jia	0a4ce78681 common : Changed tuple to struct (TODO fix) (#8823)	1 year ago
Yann Follet	646ef4a9cf embedding : more cli arguments (#7458)	1 year ago
Douglas Hanley	80ea089d77 llama : allow pooled embeddings on any model (#7477)	1 year ago
Georgi Gerganov	1442677f92 common : refactor cli arg parsing (#7675)	1 year ago
Georgi Gerganov	6ff13987ad common : normalize naming style (#7462)	1 year ago
dm4	ea3b0590ee embedding : free the batch after execution (#7297)	1 year ago
Joan Fontanals	b83cc3f5b3 llama : add Jina Embeddings architecture (#6826)	1 year ago
Jared Van Bortel	1b67731e18 BERT tokenizer fixes (#6498)	1 year ago
howlger	1e13987fba embedding : show full embedding for single prompt (#6342)	1 year ago
Minsoo Cheong	deb7240100 embedding : adjust `n_ubatch` value (#6296)	1 year ago
Georgi Gerganov	044ec4b2a5 embedding : add EOS token if not present (#899)	1 year ago
Georgi Gerganov	68265ebfc6 embedding : print all resulting embeddings (#899)	1 year ago
Georgi Gerganov	0fd6c1f015 embedding : print cosine similarity (#899)	1 year ago
slaren	f30ea47a87 llama : add pipeline parallelism support (#6017)	1 year ago
SeungWon Jeong	fb215c3832 server : normalize embeddings (#5956)	1 year ago

Newer Older

Commit History Find

Commit History