cturan/llama.cpp

Autor	SHA1 Mensaje	Fecha
Georgi Gerganov	dba497e0c1 cmake : restore LLAMA_LLAMAFILE_DEFAULT	hace 1 año
slaren	d6e1d44f16 llama : synchronize before get/set session data (#6911)	hace 1 año
slaren	0ead1f1072 llama : check that all the tensor data is in the model file (#6885)	hace 1 año
Georgi Gerganov	aa750c1ede tests : minor bash stuff (#6902)	hace 1 año
jiez	1966eb2615 quantize : add '--keep-split' to quantize model into shards (#6688)	hace 1 año
Douglas Hanley	b4e4b8a935 llama : add llama_get_pooling_type function (#6862)	hace 1 año
Johannes Gäßler	28103f4832 Server: fix seed for multiple slots (#6835)	hace 1 año
Tristan Druyen	abd3314064 llama : add phi 3 chat template (#6857)	hace 1 año
liuwei-git	c8297c6af5 llama : add phi3 support (#6852)	hace 1 año
Georgi Gerganov	8960fe86ae llama : fix typo in <\|im_end\|> token text (#6745)	hace 1 año
Georgi Gerganov	40f74e4d73 llama : add option to render special/control tokens (#6807)	hace 1 año
Wouter	7dbdba5690 llama : add llama-3 chat template (#6751)	hace 1 año
Pedro Cuenca	b97bc3966e llama : support Llama 3 HF conversion (#6745)	hace 1 año
nopperl	9958c81b79 Implement the OLMo architecture (#6741)	hace 1 año
slaren	0d56246f4b ggml : group all experts in a single ggml_mul_mat_id (#6505)	hace 1 año
Ren Xuancheng	e11b2e6e1e Qwen2 : assume tied weights if lm_head/output weights is missing (#6738)	hace 1 año
slaren	c71bfd736e llama : fix compatibility with old 2 expert models (#6735)	hace 1 año
Georgi Gerganov	532c1737a1 llama : make general.name optional (#6709)	hace 1 año
Ashish	dbceec87c0 llama : add StableLM2 12B (#6635)	hace 1 año
Shijie	f4dea7da18 llama : add qwen2moe (#6074)	hace 1 año
Daniel Bevenius	4fbd8098e6 gguf : add special tokens metadata for FIM/Infill (#6689)	hace 1 año
compilade	132f55795e llama : fix restoring the number of outputs from state files (#6687)	hace 1 año
David Renshaw	1958f7e06c llama : add missing kv clear in llama_beam_search (#6664)	hace 1 año
Chao Jiang	04fbc5f23e Add Command R chat template (#6650)	hace 1 año
Pierrick Hymbert	4bd0f93e4a model: support arch `DbrxForCausalLM` (#6515)	hace 1 año
jiez	91c736015b llama : add gguf_remove_key + remove split meta during quantize (#6591)	hace 1 año
MasterYi1024	dee7f8d692 Correct free memory and total memory. (#6630)	hace 1 año
Clint Herron	04a5ac211e Optimization: eliminate addition of redundant stacks when advancing grammar. (#6616)	hace 1 año
Olivier Chafik	cbaadc9294 grammars: 1.5x faster inference w/ complex grammars (vector reserves / reuses) (#6609)	hace 1 año
Pierrick Hymbert	b804b1ef77 eval-callback: Example how to use eval callback for debugging (#6576)	hace 1 año

Posterior Anterior

Historial de Commits Buscar

Historial de Commits