cturan/llama.cpp

Author	SHA1 Message	Date
Georgi Gerganov	692e3cdd0a memory : rename interface to llama_memory_context_i (#14296)	7 months ago
Georgi Gerganov	4c9fdfbe15 ubatch : new splitting logic (#14217)	7 months ago
Georgi Gerganov	60c666347b batch : rework llama_batch_allocr (#14153)	7 months ago
Georgi Gerganov	7f37b6cf1e memory : migrate from llama_kv_cache to more generic llama_memory (#14006)	7 months ago
Georgi Gerganov	3e63a58ef7 kv-cache : refactor the update/defrag mechanism (#13988)	7 months ago
Georgi Gerganov	3f55f781f1 llama : auto-batch preparation (#13845)	7 months ago
Georgi Gerganov	12d0188c0d kv-cache : refactor + add llama_memory_state_i (#13746)	7 months ago
Johannes Gäßler	10d2af0eaa llama/ggml: add LLM training support (#10544)	8 months ago
Georgi Gerganov	51fb96b1ff context : remove logits_all flag (#13284)	8 months ago
Georgi Gerganov	c642bc014c kv-cache : separate recurrent vs non-recurrent impl (#12799)	8 months ago
Diego Devesa	295354ea68 llama : fix K-shift with quantized K and BLAS backend (#13113)	9 months ago
fairydreaming	8fcb563613 Load all MoE experts during warmup (#11571)	10 months ago
Georgi Gerganov	84d5475541 llama : fix Gemma3 SWA KV cache shift (#12373)	10 months ago
Georgi Gerganov	e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181)	10 months ago
Georgi Gerganov	afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110)	1 year ago
Georgi Gerganov	f66f582927 llama : refactor `src/llama.cpp` (#10902)	1 year ago