Commit History

Author SHA1 Message Date
  Mikko Juola 971f245b3b llama : recognize IBM Granite 3.3 FIM tokens (#12988) 9 months ago
  Yuxuan Zhang 06bb53ad9b llama-model : add Glm4Model implementation for GLM-4-0414 (#12867) 9 months ago
  Xuan-Son Nguyen 1466621e73 llama : Support llama 4 text-only (#12791) 9 months ago
  yumeyao 5dd5d1ab00 vocab : use string_view::find() to avoid unnecessary looking up beyond the fragment range (#12706) 9 months ago
  Sigbjørn Skjæret 83a88bd6af vocab : BailingMoE : change possessive quantifiers to greedy (#12677) 9 months ago
  Daniel Bevenius c80a7759da vocab : add special infill tokens for CodeLlama (#11850) 9 months ago
  Sigbjørn Skjæret 2c3f8b850a llama : support BailingMoE (Ling) (#12634) 9 months ago
  Juyoung Suk b3de7cac73 llama : add Trillion 7B model support (#12556) 9 months ago
  compilade 00d53800e0 llama-vocab : add SuperBPE pre-tokenizer (#12532) 10 months ago
  mgroeber9110 5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150) 10 months ago
  Xuan-Son Nguyen c43a3e7996 llama : add Phi-4-mini support (supersede #12099) (#12108) 10 months ago
  mgroeber9110 ffd0821c57 vocab : correctly identify LF token for GPT-2 style BPE tokenizer (#11496) 11 months ago
  lexasub a5203b4465 llama : minor fixes for up llama load model speed (#11448) 11 months ago
  Xuan Son Nguyen ec7f3ac9ab llama : add support for Deepseek-R1-Qwen distill model (#11310) 1 year ago
  Georgi Gerganov a133566d34 vocab : fix double-eos check (#11273) 1 year ago
  Georgi Gerganov bbf3e55e35 vocab : add dummy tokens for "no_vocab" type (#11231) 1 year ago
  Daniel Bevenius 8f70fc3d1b llama : remove 'd' from bad special token log (#11212) 1 year ago
  Georgi Gerganov 08f10f69c3 llama : remove notion of CLS token (#11064) 1 year ago
  Georgi Gerganov afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 year ago
  Georgi Gerganov 727368c60f llama : use LLAMA_TOKEN_NULL (#11062) 1 year ago
  fairydreaming 9394bbd484 llama : Add support for DeepSeek V3 (#11049) 1 year ago
  Georgi Gerganov f66f582927 llama : refactor `src/llama.cpp` (#10902) 1 year ago
  Georgi Gerganov 30caac3a68 llama : the WPM vocabs use the CLS token as BOS (#10930) 1 year ago
  Georgi Gerganov 0bf2d10c55 tts : add OuteTTS support (#10784) 1 year ago
  Georgi Gerganov 08ea539df2 unicode : improve naming style (#10838) 1 year ago
  Riccardo Orlando 6fe6247831 llama : add Minerva 7B model support (#10673) 1 year ago
  wwoodsTM ff252ea48e llama : add DRY sampler (#9702) 1 year ago
  Georgi Gerganov 99bd4ac28c llama : infill sampling handle very long tokens (#9924) 1 year ago
  Daniel Bevenius 9e04102448 llama : suppress conversion from 'size_t' to 'int' (#9046) 1 year ago
  Georgi Gerganov 755a9b2bf0 llama : add infill sampler (#9896) 1 year ago