Commit History

Author SHA1 Message Date
  Dibakar Gope 0f1a39f343 ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (#5780) 1 year ago
  toyer 905942abdb llama : support glm3 and glm4 (#8031) 1 year ago
  jaime-m-p 213701b51a Detokenizer fixes (#8039) 1 year ago
  Douglas Hanley d12f781074 llama : streamline embeddings from "non-embedding" models (#8087) 1 year ago
  fairydreaming 807b0c49ff Inference support for T5 and FLAN-T5 model families (#5763) 1 year ago
  Faisal Zaghloul 968967376d Add `JAIS` model(s) (#8118) 1 year ago
  kustaaya f675b20a3b Added support for Viking pre-tokenizer (#8135) 1 year ago
  Georgi Gerganov f3f65429c4 llama : reorganize source code + improve CMake (#8006) 1 year ago