Commit History

Author SHA1 Message Date
  piDack 0cec062a63 llama : add support for GLM-Edge and GLM-Edge-V series models (#10573) 1 year ago
  Georgi Gerganov 08f10f69c3 llama : remove notion of CLS token (#11064) 1 year ago
  Molly Sophia ee7136c6d1 llama: add support for QRWKV6 model architecture (#11001) 1 year ago
  Pierrick Hymbert f8feb4b01a model: Add support for PhiMoE arch (#11003) 1 year ago
  fairydreaming 9394bbd484 llama : Add support for DeepSeek V3 (#11049) 1 year ago
  DAN™ 46be942214 llama : add support for the cohere2 model architecture (#10900) 1 year ago
  ymcki 6f0c9e034b llama : support for Llama-3_1-Nemotron-51B (#10669) 1 year ago
  Georgi Gerganov 0bf2d10c55 tts : add OuteTTS support (#10784) 1 year ago
  Valentin Mamedov a0974156f3 llama : add Deepseek MoE v1 & GigaChat models (#10827) 1 year ago
  HimariO ba1cb19cdd llama : add Qwen2VL support + multimodal RoPE (#10361) 1 year ago
  Robert Collins 62e84d9848 llama : add 128k yarn context for Qwen (#10698) 1 year ago
  Djip007 19d8762ab6 ggml : refactor online repacking (#10446) 1 year ago
  JFLFY2255 8d0cfd554a llama: Support MiniCPM-1B (with & w/o longrope) (#10559) 1 year ago
  Shane A 80acb7b430 Rename Olmo1124 to Olmo2 (#10500) 1 year ago
  Shane A a88ad007de llama : add OLMo November 2024 support (#10394) 1 year ago
  Brian a0ec17b32e metadata: Detailed Dataset Authorship Metadata (#8875) 1 year ago
  Georgi Gerganov 11ac9800af llama : improve infill support and special token detection (#9798) 1 year ago
  compilade 1927378bcc convert : refactor rope_freqs generation (#9396) 1 year ago
  Georgi Gerganov f4d2b8846a llama : add reranking support (#9510) 1 year ago
  nopperl 9a913110cf llama : add support for Chameleon (#8543) 1 year ago
  Gabe Goodhart 3d6bf6919f llama : add IBM Granite MoE architecture (#9438) 1 year ago
  Gabe Goodhart 0d2ec43833 llama : support IBM Granite architecture (#9412) 1 year ago
  Shane A 0aadac10c7 llama : support OLMoE (#9462) 1 year ago
  CarryFun 95ca85168b llama : support MiniCPM3 (#9322) 1 year ago
  compilade 9bc6db28d0 ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151) 1 year ago
  Molly Sophia 8f1d81a0b6 llama : support RWKV v6 models (#8980) 1 year ago
  Younes Belkada b40eb84895 llama : support for `falcon-mamba` architecture (#9074) 1 year ago
  Minsoo Cheong c679e0cb5c llama : add EXAONE model support (#9025) 1 year ago
  Yoshi Suhara 2a24c8caa6 Add Nemotron/Minitron GGUF Conversion & Inference Support (#8922) 1 year ago
  fairydreaming 7c3f55c100 Add support for encoder-only T5 models (#8900) 1 year ago