Commit History

Author SHA1 Message Date
  Georgi Gerganov 692e3cdd0a memory : rename interface to llama_memory_context_i (#14296) 7 months ago
  Georgi Gerganov 4c9fdfbe15 ubatch : new splitting logic (#14217) 7 months ago
  Georgi Gerganov 60c666347b batch : rework llama_batch_allocr (#14153) 7 months ago
  Georgi Gerganov 7f37b6cf1e memory : migrate from llama_kv_cache to more generic llama_memory (#14006) 7 months ago
  Georgi Gerganov 3e63a58ef7 kv-cache : refactor the update/defrag mechanism (#13988) 7 months ago
  Georgi Gerganov 3f55f781f1 llama : auto-batch preparation (#13845) 7 months ago
  Georgi Gerganov 12d0188c0d kv-cache : refactor + add llama_memory_state_i (#13746) 7 months ago
  Johannes Gäßler 10d2af0eaa llama/ggml: add LLM training support (#10544) 8 months ago
  Georgi Gerganov 51fb96b1ff context : remove logits_all flag (#13284) 8 months ago
  Georgi Gerganov c642bc014c kv-cache : separate recurrent vs non-recurrent impl (#12799) 8 months ago
  Diego Devesa 295354ea68 llama : fix K-shift with quantized K and BLAS backend (#13113) 9 months ago
  fairydreaming 8fcb563613 Load all MoE experts during warmup (#11571) 10 months ago
  Georgi Gerganov 84d5475541 llama : fix Gemma3 SWA KV cache shift (#12373) 10 months ago
  Georgi Gerganov e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) 10 months ago
  Georgi Gerganov afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 year ago
  Georgi Gerganov f66f582927 llama : refactor `src/llama.cpp` (#10902) 1 year ago