cturan/llama.cpp

Author	SHA1 Message	Date
Georgi Gerganov	cf0e3ba150 model : avoid ggml_cont_3d for fused QKV weights (#15662)	4 months ago
Georgi Gerganov	c610b6c11b kv-cache : fix SWA checks + disable cacheless iSWA (#15811)	4 months ago
Daniel Bevenius	fb15d649ed llama : add support for EmbeddingGemma 300m (#15798)	4 months ago
Georgi Gerganov	8a4280ce43 kv-cache : remove LLAMA_SET_ROWS checks (#15505)	5 months ago
Georgi Gerganov	1bded5a3b3 kv-cache : better estimate of n_kv for multi-sequence batches (#15610)	5 months ago
Georgi Gerganov	b730706a49 kv-cache : support layer reuse (#15504)	5 months ago
Georgi Gerganov	9ebebef62f llama : remove KV cache defragmentation logic (#15473)	5 months ago
Georgi Gerganov	715a6db02c kv-cache : drop the "unified" prefix (#15467)	5 months ago
Georgi Gerganov	7f37b6cf1e memory : migrate from llama_kv_cache to more generic llama_memory (#14006)	7 months ago
Georgi Gerganov	3e63a58ef7 kv-cache : refactor the update/defrag mechanism (#13988)	7 months ago
Georgi Gerganov	0fc16b42e8 kv-cache : split implementation in separate sources (#13920)	8 months ago
Georgi Gerganov	3600cc2886 llama : use n_swa + n_ubatch cells for SWA cache (#13833)	8 months ago
Georgi Gerganov	12d0188c0d kv-cache : refactor + add llama_memory_state_i (#13746)	8 months ago
Georgi Gerganov	81713121ee kv-cells : track min/max used cells and per-sequence positions (#13808)	8 months ago
Georgi Gerganov	de2ef53a4b kv-cache : rework kv_cell (#13706)	8 months ago
Georgi Gerganov	797f2ac062 kv-cache : simplify the interface (#13660)	8 months ago
Georgi Gerganov	a4090d1174 llama : remove llama_kv_cache_view API + remove deprecated (#13653)	8 months ago
Georgi Gerganov	e298d2fbd0 kv-cache : add SWA support (#13194)	8 months ago
Georgi Gerganov	e3a9421b78 kv-cache : fix out-of-bounds view during reserve graph (#13547)	8 months ago
Georgi Gerganov	c642bc014c kv-cache : separate recurrent vs non-recurrent impl (#12799)	9 months ago
Georgi Gerganov	3e1d29348b kv-cache : simplify + fix warning for recurrent models (#12756)	9 months ago
Georgi Gerganov	a10b36c91a llama : refactor kv cache guard (#12695)	10 months ago
Georgi Gerganov	e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181)	10 months ago
mgroeber9110	5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150)	11 months ago
Daniel Bevenius	3e69319772 llama : update llama_decode_internal ref [no ci] (#11840)	11 months ago
Georgi Gerganov	f66f582927 llama : refactor `src/llama.cpp` (#10902)	1 year ago

Commit History Find

Commit History