cturan/llama.cpp

Autor	SHA1 Mensaje	Fecha
Clint Herron	ad675e1c67 Added support for . (any character) token in grammar engine. (#6467)	hace 1 año
jaime-m-p	c90dbe026b Fix per token atrributes bits (#7749)	hace 1 año
Georgi Gerganov	0cd6bd3483 llama : remove beam search (#7736)	hace 1 año
jaime-m-p	3b38d48609 Per token attributes (#7685)	hace 1 año
Georgi Gerganov	5921b8f089 llama : cache llama_token_to_piece (#7587)	hace 1 año
Georgi Gerganov	eaf6e03174 llama : add comments about experimental flags (#7544)	hace 1 año
Bartowski	c429b33beb llama : add Smaug 70B support (#7402)	hace 1 año
Justine Tunney	00c6390793 main : don't print special tokens with --grammar (#6923)	hace 1 año
Daniel Bevenius	3015851c5a llama : add getters for n_threads/n_threads_batch (#7464)	hace 1 año
Anas Ahouzi	6aade19ee7 Add StableLM2 pre-tokenizer (#7349)	hace 1 año
Radoslav Gerganov	5e31828d3e ggml : add RPC backend (#6829)	hace 1 año
Ren Xuancheng	229ffff872 llama : add BPE pre-tokenization for Qwen2 (#7114)	hace 1 año
DAN™	4cd621c26d convert : add BPE pre-tokenization for DBRX (#7132)	hace 1 año
Justine Tunney	3855416027 ggml : introduce bfloat16 support (#6412)	hace 1 año
nopperl	b6aa670203 Fix OLMo HF to GGUF conversion (#6910)	hace 1 año
DAN™	889bdd7686 command-r : add BPE pre-tokenization (#7063)	hace 1 año
Georgi Gerganov	92139b90af tests : add test-tokenizer-0.sh + fix some tokenizers (#7036)	hace 1 año
Daniel Bevenius	433def286e llama : rename ctx to user_data in progress_callback (#7045)	hace 1 año
Georgi Gerganov	9c67c2773d ggml : add Flash Attention (#5021)	hace 1 año
Georgi Gerganov	f4ab2a4147 llama : fix BPE pre-tokenization (#6920)	hace 1 año
Pierrick Hymbert	0c4d489e29 quantize: add imatrix and dataset metadata in GGUF (#6658)	hace 1 año
slaren	017e6999b5 add basic tensor data validation function (#6884)	hace 1 año
jiez	1966eb2615 quantize : add '--keep-split' to quantize model into shards (#6688)	hace 1 año
Douglas Hanley	b4e4b8a935 llama : add llama_get_pooling_type function (#6862)	hace 1 año
Johannes Gäßler	28103f4832 Server: fix seed for multiple slots (#6835)	hace 1 año
Georgi Gerganov	40f74e4d73 llama : add option to render special/control tokens (#6807)	hace 1 año
Pedro Cuenca	b97bc3966e llama : support Llama 3 HF conversion (#6745)	hace 1 año
Olivier Chafik	cbaadc9294 grammars: 1.5x faster inference w/ complex grammars (vector reserves / reuses) (#6609)	hace 1 año
Jared Van Bortel	1b67731e18 BERT tokenizer fixes (#6498)	hace 1 año
Rick G	e3c337d87c llama : support negative ith in llama_get_ API (#6519)	hace 1 año

Posterior Anterior

Historial de Commits Buscar

Historial de Commits