cturan/llama.cpp

Author	SHA1 Message	Date
Xuan-Son Nguyen	fbdfefe74e llama : gemma3 : use output tensor if it exists in model weight (#12506)	10 months ago
Georgi Gerganov	af04481e6b model : do not repack if a GPU device is present (#12498)	10 months ago
Sigbjørn Skjæret	960e726077 chore : cleanup llama_model_loader::TENSOR_ usage (#12492)	10 months ago
Sigbjørn Skjæret	dbb3a4739e llama : make Qwen2MoE QKV bias optional (#12477)	10 months ago
Sigbjørn Skjæret	108e53c2f1 llama : add support for GPT2, Bloom and CodeShell tied word embeddings (#12456)	10 months ago
Georgi Gerganov	75422e8bc4 graph : normalize Q, K, V shapes + sync cross attention (#12449)	10 months ago
Xuan-Son Nguyen	99aa304fb9 llama : add support for EXAONE tied word embeddings (#12451)	10 months ago
Molly Sophia	7dfad387e3 llama: Add support for RWKV v7 architecture (#12412)	10 months ago
Sigbjørn Skjæret	8ba95dca20 llama : fix OLMo-2-0325-32B-Instruct K-norm size (#12400)	10 months ago
Georgi Gerganov	c522ce4143 graph : simplify attn input build for unified KV cache (#12381)	10 months ago
Georgi Gerganov	081bee8c64 hparams : add SWA rope parameters (#12374)	10 months ago
Georgi Gerganov	84d5475541 llama : fix Gemma3 SWA KV cache shift (#12373)	10 months ago
Georgi Gerganov	e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181)	10 months ago
Xuan-Son Nguyen	7841fc723e llama : Add Gemma 3 support (+ experimental vision capability) (#12343)	10 months ago
Xuan-Son Nguyen	c43a3e7996 llama : add Phi-4-mini support (supersede #12099) (#12108)	10 months ago
Vitali Lovich	3e9a2860e9 llama : expose llama_model_n_head_kv in the API (#11997)	11 months ago
Georgi Gerganov	51f311e057 llama : skip loading unused tensors (#12004)	11 months ago
Georgi Gerganov	bdcf8b6a56 cont : fix mmap flag print (#11699)	11 months ago
Georgi Gerganov	9dd7a0390f llama : add log about loading model tensors (#11699)	11 months ago
piDack	0cec062a63 llama : add support for GLM-Edge and GLM-Edge-V series models (#10573)	11 months ago
Frank Mai	1d8ee06000 rpc: fix register position (#11424)	1 year ago
Olivier Chafik	6171c9d258 Add Jinja template support (#11016)	1 year ago
Georgi Gerganov	ef6dada60c cont : fix whitespaces (#11305)	1 year ago
Kyle Bruene	ae3c1db2f9 llama : re-add LLM_ARCH_PHIMOE (#11305)	1 year ago
Radoslav Gerganov	667d72846c rpc : early register backend devices (#11262)	1 year ago
Georgi Gerganov	afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110)	1 year ago
Molly Sophia	ee7136c6d1 llama: add support for QRWKV6 model architecture (#11001)	1 year ago
Pierrick Hymbert	f8feb4b01a model: Add support for PhiMoE arch (#11003)	1 year ago
Georgi Gerganov	47182dd03f llama : update llama_model API names (#11063)	1 year ago
Georgi Gerganov	727368c60f llama : use LLAMA_TOKEN_NULL (#11062)	1 year ago

Newer Older

Commit History Find

Commit History