Historique des commits

Auteur SHA1 Message Date
  Georgi Gerganov 9067487c44 ggml : fix FA mask dim 2 and 3 (#14505) il y a 6 mois
  Aman Gupta 55c2646b45 CUDA: add dynamic shared mem to softmax, refactor general usage (#14497) il y a 6 mois
  compilade 5d46babdc2 llama : initial Mamba-2 support (#9126) il y a 6 mois
  Georgi Gerganov ec68e84c32 ggml : support bcast ggml_soft_max_ext, ggml_flash_attn_ext (#14435) il y a 6 mois
  Jeff Bolz 6a746cf9c4 vulkan: Split large mul_mat_id to fit in shared memory (#14451) il y a 6 mois
  Acly 431b2c24f3 ggml-cpu : "align corners" for bilinear upscale/downscale (ggml/1285) il y a 6 mois
  Diego Devesa eb3fa2913e test-backend-ops : disable llama test (#14461) il y a 6 mois
  Sigbjørn Skjæret a0535ffa0d ggml : implement REGLU/GEGLU/SWIGLU ops (#14158) il y a 6 mois
  Jeff Bolz bd9c981d72 vulkan: Add fusion support for RMS_NORM+MUL (#14366) il y a 6 mois
  Aman Gupta 27208bf657 CUDA: add bf16 and f32 support to cublas_mul_mat_batched (#14361) il y a 6 mois
  Radoslav Gerganov 8d94219a4a ggml : add ggml_set_rows (#14274) il y a 6 mois
  Georgi Gerganov e8215dbb96 metal : add special-case mat-vec mul for ne00 == 4 (#14385) il y a 6 mois
  Aman Gupta aa064b2eb7 CUDA: add mean operation (#14313) il y a 7 mois
  Aman Gupta c959f462a0 CUDA: add conv_2d_transpose (#14287) il y a 7 mois
  Ervin Áron Tasnádi 0d3984424f ggml-vulkan: adds support for op CONV_TRANSPOSE_1D (#13813) il y a 7 mois
  Johannes Gäßler 10d2af0eaa llama/ggml: add LLM training support (#10544) il y a 8 mois
  Georgi Gerganov b34443923c sync : ggml (#13268) il y a 8 mois
  Johannes Gäßler b0ecbd434b test: non-cont. b in test-backend-ops -o MUL_MAT (#13187) il y a 8 mois
  Johannes Gäßler e1e8e0991f CUDA: batched+noncont MMQ, refactor bs>1 MoE code (#13199) il y a 8 mois
  Xuan-Son Nguyen edb18b6e8f clip : fix pixtral on some GPU backends (#13097) il y a 8 mois
  Johannes Gäßler 658987cfc9 CUDA: noncont MMVQ + batched bs1 MUL_MAT_ID (#13014) il y a 9 mois
  Georgi Gerganov 2f74c354c0 graph : make FA compatible with MLA + add initial Metal kernels (#12953) il y a 9 mois
  Jeff Bolz 015022bb53 vulkan: enable coopmat2 FA gqa and split_k optimizations more often (#12931) il y a 9 mois
  Georgi Gerganov 1d2b613445 tests : fix init order (#0) il y a 9 mois
  Diego Devesa fe92821ea9 ggml : add bilinear upscale support (ggml/1185) il y a 9 mois
  Jeff Bolz f01bd02376 vulkan: Implement split_k for coopmat2 flash attention. (#12627) il y a 9 mois
  Georgi Gerganov b4ae50810e metal : improve FA + improve MoE (#12612) il y a 9 mois
  Jeff Bolz 9b169a4d4e vulkan: fix mul_mat_vec failure in backend tests (#12529) il y a 10 mois
  Georgi Gerganov ba932dfb50 ggml : fix quantized cpy op (#12310) il y a 10 mois
  Jeff Bolz eddfb43850 vulkan: Optimize mul_mat_vec p021 and nc shaders (#12505) il y a 10 mois