Historique des commits

Auteur SHA1 Message Date
  Johannes Gäßler 1425f587a8 CUDA: attention sinks for mma FlashAttention (#15157) il y a 5 mois
  lhez aaa3d07ae7 opencl: support sink in `soft_max` (attn sinks) (#15152) il y a 5 mois
  Xuan-Son Nguyen 50aa938901 convert : support non-mxfp4 HF model (#15153) il y a 5 mois
  Jeff Bolz c4f53563df vulkan: support fattn sinks (#15126) il y a 5 mois
  Jeff Bolz a0552c8bee vulkan: Add env var to disable host visible vidmem (#15109) il y a 5 mois
  RunningLeon 99acbc9921 llama : Support intern-s1 (#14875) il y a 5 mois
  uvos 7ad67ba9fe HIP: add cmake option to enable compiler output of kernel resource usage metrics (#15103) il y a 5 mois
  Christian Kastner 9a96389544 ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (#15094) il y a 5 mois
  Johannes Gäßler 1d72c84188 CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (#15131) il y a 5 mois
  Johannes Gäßler 20638e4f16 scripts: fix crash when --tool is not set (#15133) il y a 5 mois
  Daniel Bevenius 36d3f00e14 requirements : fix PyTorch uint64 compatibility (#15134) il y a 5 mois
  Reese Levine 5fd160bbd9 ggml: Add basic SET_ROWS support in WebGPU (#15137) il y a 5 mois
  rmatif 756cfea826 fix profiling crash (#15072) il y a 5 mois
  lhez e725a1a982 opencl: add `swiglu_oai` and `add_id` (#15121) il y a 5 mois
  Sachin Desai 3db4da56a5 chat : support Granite model reasoning and tool call (#14864) il y a 5 mois
  Juk Armstrong 476aa3fd57 Fixed name `-override-tensors` to `-override-tensor` (#15129) il y a 5 mois
  Diego Devesa 0d8831543c ggml : fix fallback to CPU for ununsupported ops (#15118) il y a 5 mois
  Sigbjørn Skjæret 65c797c4fa chat : fix yandex chat template (#15116) il y a 5 mois
  stevenkuang 25726898e8 chat : fix hunyuan auto-detection (#15114) il y a 5 mois
  Chenguang Li 2241453252 CANN: add support for ACL Graph (#15065) il y a 5 mois
  Reese Levine 9515c6131a ggml: WebGPU disable SET_ROWS for now (#15078) il y a 5 mois
  Georgi Gerganov fd1234cb46 llama : add gpt-oss (#15091) il y a 5 mois
  Sigbjørn Skjæret f324a3b715 chat : only remove double bos/eos if added (#15086) il y a 5 mois
  Georgi Gerganov be42642581 readme : update hot topics (#15097) il y a 6 mois
  Romain Biessy 3306ceabf0 sycl: fix mul_mat selection (#15092) il y a 6 mois
  Juk Armstrong c81de6e107 Fix `glm4moe` bug (#15088) il y a 6 mois
  Alex Wu 22f060c9c4 webui: fix markdown table (#15081) il y a 6 mois
  compilade ee3a9fcf88 context : fix index overflow on huge outputs (#15080) il y a 6 mois
  Diego Devesa ec428b02c3 llama : add --n-cpu-moe option (#15077) il y a 6 mois
  compilade 19f68fa5a4 imatrix : warn when GGUF imatrix is saved without .gguf suffix (#15076) il y a 6 mois