Commit History

Author SHA1 Message Date
  compilade 5d46babdc2 llama : initial Mamba-2 support (#9126) 6 months ago
  Weizhao Ouyang 566c16fcce model : add support for ERNIE 4.5 0.3B model (#14408) 6 months ago
  Xuan-Son Nguyen 8846aace49 model : gemma3n text-only (#14400) 7 months ago
  Mikko Juola 9ae4143bc6 model : add dots.llm1 architecture support (#14044) (#14118) 7 months ago
  Sigbjørn Skjæret d17a809ef0 llama : support multiple classifier outputs and labels (#13940) 7 months ago
  Georgi Gerganov e298d2fbd0 kv-cache : add SWA support (#13194) 8 months ago
  Johannes Gäßler 10d2af0eaa llama/ggml: add LLM training support (#10544) 8 months ago
  ymcki 3bf785f3ef llama : Llama-3_1-Nemotron-Ultra-253B-v1 support (#12843) 8 months ago
  Georgi Gerganov c642bc014c kv-cache : separate recurrent vs non-recurrent impl (#12799) 8 months ago
  Jared Van Bortel a70183eb00 llama-model : fix the reported size class for nomic-embed-text-v2-moe (#13223) 8 months ago
  Sigbjørn Skjæret 7d3af70b08 llama : llm_type order by size (#13177) 8 months ago
  Sigbjørn Skjæret e98b3692be llama : set qwen3 model type sizes (#13175) 8 months ago
  Juk Armstrong daa422881a llama : DeepSeek V2/V3 MLA implementation (#12801) 9 months ago
  Xuan-Son Nguyen 1466621e73 llama : Support llama 4 text-only (#12791) 9 months ago
  Diego Devesa e0e912f49b llama : add option to override model tensor buffers (#11397) 9 months ago
  Sigbjørn Skjæret 2c3f8b850a llama : support BailingMoE (Ling) (#12634) 9 months ago
  Si1w f125b8dccf llama : add PLM GGUF Conversion & Inference Support (#12457) 10 months ago
  Molly Sophia 7dfad387e3 llama: Add support for RWKV v7 architecture (#12412) 10 months ago
  Georgi Gerganov e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) 10 months ago
  Radoslav Gerganov 667d72846c rpc : early register backend devices (#11262) 1 year ago
  Georgi Gerganov afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 year ago
  Molly Sophia ee7136c6d1 llama: add support for QRWKV6 model architecture (#11001) 1 year ago
  Pierrick Hymbert f8feb4b01a model: Add support for PhiMoE arch (#11003) 1 year ago
  fairydreaming 9394bbd484 llama : Add support for DeepSeek V3 (#11049) 1 year ago
  Georgi Gerganov f66f582927 llama : refactor `src/llama.cpp` (#10902) 1 year ago