cturan/llama.cpp

Autor	SHA1 Zpráva	Datum
Elaine	41b9260f18 convert : add Poro-34B-chat tokenizer support (#7713)	před 1 rokem
Clint Herron	ad675e1c67 Added support for . (any character) token in grammar engine. (#6467)	před 1 rokem
jaime-m-p	c90dbe026b Fix per token atrributes bits (#7749)	před 1 rokem
Georgi Gerganov	0cd6bd3483 llama : remove beam search (#7736)	před 1 rokem
jaime-m-p	3b38d48609 Per token attributes (#7685)	před 1 rokem
Georgi Gerganov	5921b8f089 llama : cache llama_token_to_piece (#7587)	před 1 rokem
Georgi Gerganov	eaf6e03174 llama : add comments about experimental flags (#7544)	před 1 rokem
Bartowski	c429b33beb llama : add Smaug 70B support (#7402)	před 1 rokem
Justine Tunney	00c6390793 main : don't print special tokens with --grammar (#6923)	před 1 rokem
Daniel Bevenius	3015851c5a llama : add getters for n_threads/n_threads_batch (#7464)	před 1 rokem
Anas Ahouzi	6aade19ee7 Add StableLM2 pre-tokenizer (#7349)	před 1 rokem
Radoslav Gerganov	5e31828d3e ggml : add RPC backend (#6829)	před 1 rokem
Ren Xuancheng	229ffff872 llama : add BPE pre-tokenization for Qwen2 (#7114)	před 1 rokem
DAN™	4cd621c26d convert : add BPE pre-tokenization for DBRX (#7132)	před 1 rokem
Justine Tunney	3855416027 ggml : introduce bfloat16 support (#6412)	před 1 rokem
nopperl	b6aa670203 Fix OLMo HF to GGUF conversion (#6910)	před 1 rokem
DAN™	889bdd7686 command-r : add BPE pre-tokenization (#7063)	před 1 rokem
Georgi Gerganov	92139b90af tests : add test-tokenizer-0.sh + fix some tokenizers (#7036)	před 1 rokem
Daniel Bevenius	433def286e llama : rename ctx to user_data in progress_callback (#7045)	před 1 rokem
Georgi Gerganov	9c67c2773d ggml : add Flash Attention (#5021)	před 1 rokem
Georgi Gerganov	f4ab2a4147 llama : fix BPE pre-tokenization (#6920)	před 1 rokem
Pierrick Hymbert	0c4d489e29 quantize: add imatrix and dataset metadata in GGUF (#6658)	před 1 rokem
slaren	017e6999b5 add basic tensor data validation function (#6884)	před 1 rokem
jiez	1966eb2615 quantize : add '--keep-split' to quantize model into shards (#6688)	před 1 rokem
Douglas Hanley	b4e4b8a935 llama : add llama_get_pooling_type function (#6862)	před 1 rokem
Johannes Gäßler	28103f4832 Server: fix seed for multiple slots (#6835)	před 1 rokem
Georgi Gerganov	40f74e4d73 llama : add option to render special/control tokens (#6807)	před 1 rokem
Pedro Cuenca	b97bc3966e llama : support Llama 3 HF conversion (#6745)	před 1 rokem
Olivier Chafik	cbaadc9294 grammars: 1.5x faster inference w/ complex grammars (vector reserves / reuses) (#6609)	před 1 rokem
Jared Van Bortel	1b67731e18 BERT tokenizer fixes (#6498)	před 1 rokem

Novější Starší

Historie revizí Hledat

Historie revizí