cturan/llama.cpp

mirror of https://github.com/cturan/llama.cpp

Author	SHA1 Message	Date
Diego Devesa	295354ea68 llama : fix K-shift with quantized K and BLAS backend (#13113)	9 months ago
fairydreaming	8fcb563613 Load all MoE experts during warmup (#11571)	10 months ago
Georgi Gerganov	84d5475541 llama : fix Gemma3 SWA KV cache shift (#12373)	10 months ago
Georgi Gerganov	e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181)	10 months ago
Georgi Gerganov	afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110)	1 year ago
Georgi Gerganov	f66f582927 llama : refactor `src/llama.cpp` (#10902)	1 year ago