Historique des commits

Auteur SHA1 Message Date
  Xuan-Son Nguyen 4d3726278b model: add llama 4 scaling for mistral-large (deepseek arch) (#17744) il y a 1 mois
  Herman Semenoff 37adc9c6ba ggml, llama : use defaulted constructors/destructors (#17649) il y a 2 mois
  Piotr Wilkin (ilintar) 746f9ee889 Override SSM_A op for Qwen3 Next to reduce splits (#17587) il y a 2 mois
  Gilad S. 00c361fe53 fix: llama arch implementation (#17665) il y a 2 mois
  Xuan-Son Nguyen cd3c118908 model: support Ministral3 (#17644) il y a 2 mois
  Piotr Wilkin (ilintar) ff55414c42 model : Qwen3 Next (#16095) il y a 2 mois
  Georgi Gerganov 6783b11fb0 models : fix LFM2 tensors (#17548) il y a 2 mois
  Aaron Teo 877566d512 llama: introduce support for model-embedded sampling parameters (#17120) il y a 2 mois
  william pan 4902eebe33 models : Added support for RND1 Diffusion Language Model (#17433) il y a 2 mois
  ubergarm 23bc779a6e model : detect GigaChat3-10-A1.8B as deepseek lite (#17420) il y a 2 mois
  Bartowski e1fcf8b09b model : add AfmoeForCausalLM support (#16477) il y a 2 mois
  Sigbjørn Skjæret 9008027aa3 hparams : add n_embd_inp() to support extended embed (#16928) il y a 2 mois
  Li Pengzhan 9f052478c2 model : add openPangu-Embedded (#16941) il y a 2 mois
  Georgi Gerganov cd5e3b5754 server : support unified cache across slots (#16736) il y a 3 mois
  Piotr Wilkin (ilintar) bea04522ff refactor : llama-model.cpp (#16252) il y a 3 mois
  Piotr Wilkin (ilintar) 0de0a01576 model : Minimax M2 (#16831) il y a 3 mois
  Giuseppe Scrivano e58d585604 model : add Granite Hybrid nano types (#16896) il y a 3 mois
  JJJYmmm d261223d24 model: add support for qwen3vl series (#16780) il y a 3 mois
  Tianyue-Zhao bacddc049a model: Add support for CogVLM model (#15002) il y a 3 mois
  Georgi Gerganov 85a7d8677b memory : remove KV cache size padding (#16812) il y a 3 mois
  Johannes Gäßler 7a0e900e36 llama: consistent ctx <-> buf order for KV cache (#16746) il y a 3 mois
  Johannes Gäßler 945501f5ea llama: fix leaked buffers for mmap + split files (#16765) il y a 3 mois
  Sigbjørn Skjæret 73a48c9790 convert : enable expert group selection for all models with it (#16691) il y a 3 mois
  Sigbjørn Skjæret 7cce4f8158 model : set res->t_embd in SmallThinker models (#16782) il y a 3 mois
  Shunta Saito 226f295f4d model : set res->t_embd in PLaMo2 models (#16766) il y a 3 mois
  Max Krasnyansky 63d2fc46e1 Add experimental ggml-hexagon backend for the Hexagon NPU (#16547) il y a 3 mois
  Sigbjørn Skjæret 84bf3c6778 model : add BailingMoeV2 support (#16063) il y a 3 mois
  Giuseppe Scrivano 0398752dd4 model : add Granite Hybrid types (#16635) il y a 3 mois
  Johannes Gäßler 66b0dbcb2d llama-model: fix insonsistent ctxs <-> bufs order (#16581) il y a 3 mois
  Xuan-Son Nguyen 3e3cb19f64 llama-quant: add support for mmproj (#16592) il y a 3 mois