cturan/llama.cpp

Autor	SHA1 Správa	Dátum
DAN™	5a51cc1bb4 main : support special tokens as reverse/anti prompt (#5847)	1 rok pred
Georgi Gerganov	bf08e00643 llama : refactor k-shift implementation + KV defragmentation (#5691)	1 rok pred
Jared Van Bortel	89febfed93 examples : do not assume BOS when shifting context (#5622)	1 rok pred
bmwl	f486f6e1e5 ggml : add numa options (#5377)	1 rok pred
Georgi Gerganov	85910c5b30 main : ctrl+C print timing in non-interactive mode (#3873)	1 rok pred
Michael Klimenko	52bb63c708 refactor : switch to emplace_back to avoid extra object (#5291)	1 rok pred
divinity76	813416991a main : allow empty --prompt-cache file (#5176)	1 rok pred
Yann Follet	722d33f34e main : add parameter --no-display-prompt (#4541)	2 rokov pred
Georgi Gerganov	7edefbd79c main : better name for variable n_print (#4874)	2 rokov pred
Georgi Gerganov	3ca63b4538 main : disable token count by default (#4874)	2 rokov pred
pudepiedj	43f76bf1c3 main : print total token count and tokens consumed so far (#4874)	2 rokov pred
Georgi Gerganov	52531fdff8 main : add self-extend support (#4815)	2 rokov pred
MaggotHATE	52c8bc3cf3 sampling : custom samplers order (#4285)	2 rokov pred
Andrew Godfrey	8efa0f6ebe main : pass LOG_TEE callback to llama.cpp log (#4033)	2 rokov pred
Seb C	881800d1f0 main : Add ChatML functionality to main example (#4046)	2 rokov pred
Kerfuffle	91f6499393 Respect tokenizer.ggml.add_bos_token value when tokenizing (#4040)	2 rokov pred
cebtenzzre	b12fa0d1c1 build : link against build info instead of compiling against it (#3879)	2 rokov pred
Kerfuffle	6e08281e58 Extend llama_kv_cache_seq_rm to allow matching any sequence (#3843)	2 rokov pred
Marcus Dunn	5be6c803fa llama : remove token functions with `context` args in favor of `model` (#3720)	2 rokov pred
vvhg1	d3956aea53 main : escape prompt for cfg_negative_prompt and consecutive inputs in main with interactive (#3623)	2 rokov pred
Georgi Gerganov	d1031cf49c sampling : refactor init to use llama_sampling_params (#3696)	2 rokov pred
Georgi Gerganov	0e89203b51 speculative : add tree-based sampling example (#3624)	2 rokov pred
Georgi Gerganov	e1675d133c llama : avoid fprintf in favor of LLAMA_LOG (#3538)	2 rokov pred
staviq	1a159553f9 tokenizer : special token handling (#3538)	2 rokov pred
Georgi Gerganov	b8fe4b5cc9 main : fix session loading bug (#3400)	2 rokov pred
Kerfuffle	70c29da118 common : fix mirostat state when using multiple sequences (#3543)	2 rokov pred
h-h-h-h	8186242b6d main : consistent prefix/suffix coloring (#3425)	2 rokov pred
Georgi Gerganov	ac2219fef3 llama : fix session saving/loading (#3400)	2 rokov pred
Cebtenzzre	bc39553c90 build : enable more non-default compiler warnings (#3200)	2 rokov pred
slaren	16bc66d947 llama.cpp : split llama_context_params into model and context params (#3301)	2 rokov pred

Novšie Staršie

História revízii Nájsť

História revízii