Commit History

Author SHA1 Message Date
  Xuan Son Nguyen 46110e0630 split q_proj/gate 4 months ago
  Piotr Wilkin (ilintar) c78f9fce68 Merge branch 'ggml-org:master' into qwen3_next 4 months ago
  Piotr Wilkin 344331c2b6 First draft 4 months ago
  Xuan-Son Nguyen 8f8f2274ee convert : add Llama4ForCausalLM (#16042) 4 months ago
  Daniel Bevenius 2c8dac72eb llama : fix incorrect model type for Gemma 270M (#15764) 4 months ago
  Sigbjørn Skjæret 84ab83cc0b model : jina-embeddings-v3 support (#13693) 5 months ago
  Piotr Wilkin (ilintar) b1afcab804 model : add support for Seed-OSS (#15490) 5 months ago
  Georgi Gerganov 9ef6b0b835 model : add gpt-oss type strings (#15424) 5 months ago
  Daniel Bevenius 7a0de96045 llama : add 18-layer model type for Gemma 3-270m (#15319) 5 months ago
  Georgi Gerganov fd1234cb46 llama : add gpt-oss (#15091) 5 months ago
  Sam ef0144c087 model: support GLM 4.5 family of models (#14939) 5 months ago
  Piotr Wilkin (ilintar) cb887f1bc1 model: add Ernie 4.5 MoE support (#14658) 6 months ago
  Georgi Gerganov 01612b7409 llama : reuse compute graphs (#14482) 6 months ago
  Tarek Dakhran f5e96b368f model : support LiquidAI LFM2 hybrid family (#14620) 6 months ago
  Ryan Mangeno 4bb625b713 Smoldocling support (#14597) 6 months ago
  compilade 4a5686da22 llama : support Jamba hybrid Transformer-Mamba models (#7531) 6 months ago
  Xuan-Son Nguyen 8f22dc0a53 model : add hunyuan moe (#14425) 6 months ago
  compilade 5d46babdc2 llama : initial Mamba-2 support (#9126) 7 months ago
  Weizhao Ouyang 566c16fcce model : add support for ERNIE 4.5 0.3B model (#14408) 7 months ago
  Xuan-Son Nguyen 8846aace49 model : gemma3n text-only (#14400) 7 months ago
  Mikko Juola 9ae4143bc6 model : add dots.llm1 architecture support (#14044) (#14118) 7 months ago
  Sigbjørn Skjæret d17a809ef0 llama : support multiple classifier outputs and labels (#13940) 7 months ago
  Georgi Gerganov e298d2fbd0 kv-cache : add SWA support (#13194) 8 months ago
  Johannes Gäßler 10d2af0eaa llama/ggml: add LLM training support (#10544) 8 months ago
  ymcki 3bf785f3ef llama : Llama-3_1-Nemotron-Ultra-253B-v1 support (#12843) 9 months ago
  Georgi Gerganov c642bc014c kv-cache : separate recurrent vs non-recurrent impl (#12799) 9 months ago
  Jared Van Bortel a70183eb00 llama-model : fix the reported size class for nomic-embed-text-v2-moe (#13223) 9 months ago
  Sigbjørn Skjæret 7d3af70b08 llama : llm_type order by size (#13177) 9 months ago
  Sigbjørn Skjæret e98b3692be llama : set qwen3 model type sizes (#13175) 9 months ago
  Juk Armstrong daa422881a llama : DeepSeek V2/V3 MLA implementation (#12801) 9 months ago