cturan/llama.cpp

Author	SHA1 Message	Date
Georgi Gerganov	c610b6c11b kv-cache : fix SWA checks + disable cacheless iSWA (#15811)	4 months ago
Daniel Bevenius	fb15d649ed llama : add support for EmbeddingGemma 300m (#15798)	4 months ago
Georgi Gerganov	b730706a49 kv-cache : support layer reuse (#15504)	5 months ago
Georgi Gerganov	715a6db02c kv-cache : drop the "unified" prefix (#15467)	5 months ago
Georgi Gerganov	d32e03f449 server : add SWA checkpoints (#15293)	5 months ago
compilade	11a3811164 memory : handle kv_unified for hybrid models (#15050)	5 months ago
Diner Burger	496957e1cb llama : fix parameter order for hybrid memory initialization (#14725)	6 months ago
Georgi Gerganov	225e7a1438 llama : add high-throughput mode (#14363)	6 months ago
Georgi Gerganov	67d1ef23c6 batch : add optional for sequential equal split (#14511)	6 months ago
Georgi Gerganov	c79184d2d1 batch : add n_used count (#14512)	6 months ago
Georgi Gerganov	a70c8a0c4b kv-cache : use ggml_set_rows (#14285)	6 months ago
Georgi Gerganov	745f11fed0 memory : correctly handle failure in apply() (#14438)	6 months ago
Georgi Gerganov	692e3cdd0a memory : rename interface to llama_memory_context_i (#14296)	7 months ago
Georgi Gerganov	4c9fdfbe15 ubatch : new splitting logic (#14217)	7 months ago
Gabe Goodhart	edc4a29eff memory : Hybrid recurrent cache (#13979)	7 months ago