커밋 기록

작성자 SHA1 메시지 날짜
  Georgi Gerganov c610b6c11b kv-cache : fix SWA checks + disable cacheless iSWA (#15811) 5 달 전
  Daniel Bevenius fb15d649ed llama : add support for EmbeddingGemma 300m (#15798) 5 달 전
  Georgi Gerganov b730706a49 kv-cache : support layer reuse (#15504) 5 달 전
  Dongliang Wei 6c6e397aff model : add support for SmallThinker series (#14898) 6 달 전
  Georgi Gerganov 225e7a1438 llama : add high-throughput mode (#14363) 6 달 전
  Tarek Dakhran f5e96b368f model : support LiquidAI LFM2 hybrid family (#14620) 7 달 전
  compilade 5d46babdc2 llama : initial Mamba-2 support (#9126) 7 달 전
  Georgi Gerganov 4c9fdfbe15 ubatch : new splitting logic (#14217) 7 달 전
  Gabe Goodhart edc4a29eff memory : Hybrid recurrent cache (#13979) 7 달 전
  Georgi Gerganov d13d0f6135 hparams : initialize arrays (#13728) 8 달 전
  Xuan-Son Nguyen 8a2afb7520 llama : allow custom list of swa_layers (#13726) 8 달 전
  Georgi Gerganov 8e186ef0e7 hparams : support models for which all layers use SWA (#13682) 8 달 전
  Georgi Gerganov 081bee8c64 hparams : add SWA rope parameters (#12374) 10 달 전
  Georgi Gerganov 84d5475541 llama : fix Gemma3 SWA KV cache shift (#12373) 11 달 전
  Molly Sophia ee7136c6d1 llama: add support for QRWKV6 model architecture (#11001) 1 년 전
  Georgi Gerganov f66f582927 llama : refactor `src/llama.cpp` (#10902) 1 년 전