cturan/llama.cpp

Autor	SHA1 Wiadomość	Data
Pedro Cuenca	b97bc3966e llama : support Llama 3 HF conversion (#6745)	1 rok temu
Jared Van Bortel	1b67731e18 BERT tokenizer fixes (#6498)	1 rok temu
Jared Van Bortel	4d4d2366fc convert : automatically fall back to HfVocab if tokenizer.model doesn't exist (#5821)	1 rok temu
Georgi Gerganov	bf08e00643 llama : refactor k-shift implementation + KV defragmentation (#5691)	1 rok temu
bmwl	f486f6e1e5 ggml : add numa options (#5377)	1 rok temu
Michael Klimenko	35a2ee9143 Remove unused data and add fixes (#5154)	2 lat temu
Seb C	881800d1f0 main : Add ChatML functionality to main example (#4046)	2 lat temu
Kerfuffle	91f6499393 Respect tokenizer.ggml.add_bos_token value when tokenizing (#4040)	2 lat temu
cebtenzzre	b12fa0d1c1 build : link against build info instead of compiling against it (#3879)	2 lat temu
Marcus Dunn	5be6c803fa llama : remove token functions with `context` args in favor of `model` (#3720)	2 lat temu
Georgi Gerganov	d1031cf49c sampling : refactor init to use llama_sampling_params (#3696)	2 lat temu
Georgi Gerganov	0e89203b51 speculative : add tree-based sampling example (#3624)	2 lat temu
Kerfuffle	70c29da118 common : fix mirostat state when using multiple sequences (#3543)	2 lat temu
vvhg1	11ea5c7d96 infill. : fix tokenization (#3508)	2 lat temu
vvhg1	c97f01c362 infill : add new example + extend server API (#3296)	2 lat temu