cturan/llama.cpp

Autor	SHA1 Zpráva	Datum
Georgi Gerganov	6262d13e0b common : reimplement logging (#9418)	před 1 rokem
Georgi Gerganov	0abc6a2c25 llama : llama_perf + option to disable timings during decode (#9355)	před 1 rokem
slaren	49006c67b4 llama : move random seed generation to the samplers (#9398)	před 1 rokem
Xuan Son Nguyen	bfe76d4a17 common : move arg parser code to `arg.cpp` (#9388)	před 1 rokem
Xuan Son Nguyen	1b9ae5189c common : refactor arg parser (#9308)	před 1 rokem
Georgi Gerganov	df270ef745 llama : refactor sampling v2 (#9294)	před 1 rokem
fairydreaming	7c3f55c100 Add support for encoder-only T5 models (#8900)	před 1 rokem
Liu Jia	0a4ce78681 common : Changed tuple to struct (TODO fix) (#8823)	před 1 rokem
Yann Follet	646ef4a9cf embedding : more cli arguments (#7458)	před 1 rokem
Douglas Hanley	80ea089d77 llama : allow pooled embeddings on any model (#7477)	před 1 rokem
Georgi Gerganov	1442677f92 common : refactor cli arg parsing (#7675)	před 1 rokem
Georgi Gerganov	6ff13987ad common : normalize naming style (#7462)	před 1 rokem
dm4	ea3b0590ee embedding : free the batch after execution (#7297)	před 1 rokem
Joan Fontanals	b83cc3f5b3 llama : add Jina Embeddings architecture (#6826)	před 1 rokem
Jared Van Bortel	1b67731e18 BERT tokenizer fixes (#6498)	před 1 rokem
howlger	1e13987fba embedding : show full embedding for single prompt (#6342)	před 1 rokem
Minsoo Cheong	deb7240100 embedding : adjust `n_ubatch` value (#6296)	před 1 rokem
Georgi Gerganov	044ec4b2a5 embedding : add EOS token if not present (#899)	před 1 rokem
Georgi Gerganov	68265ebfc6 embedding : print all resulting embeddings (#899)	před 1 rokem
Georgi Gerganov	0fd6c1f015 embedding : print cosine similarity (#899)	před 1 rokem
slaren	f30ea47a87 llama : add pipeline parallelism support (#6017)	před 1 rokem
SeungWon Jeong	fb215c3832 server : normalize embeddings (#5956)	před 1 rokem
Georgi Gerganov	29ae62d2ae llama : fix embeddings (#5796)	před 1 rokem
bmwl	f486f6e1e5 ggml : add numa options (#5377)	před 1 rokem
Douglas Hanley	03bf161eb6 llama : support batched embeddings (#5466)	před 1 rokem
Douglas Hanley	2891c8aa9a Add support for BERT embedding models (#5423)	před 1 rokem
cebtenzzre	b12fa0d1c1 build : link against build info instead of compiling against it (#3879)	před 2 roky
slaren	16bc66d947 llama.cpp : split llama_context_params into model and context params (#3301)	před 2 roky
Georgi Gerganov	ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228)	před 2 roky
Cebtenzzre	8781013ef6 make : restore build-info.h dependency for several targets (#3205)	před 2 roky

Novější Starší

Historie revizí Hledat

Historie revizí