Commit History

Autor SHA1 Mensaxe Data
  Xuan-Son Nguyen 00fa15fedc mtmd : add support for Voxtral (#14862) hai 6 meses
  Johannes Gäßler 946b1f6859 CUDA: fix pointer incrementation in FA (#14916) hai 6 meses
  Dongliang Wei 6c6e397aff model : add support for SmallThinker series (#14898) hai 6 meses
  Alberto Cabrera Pérez afc0e89698 sycl: refactor quantization to q8_1 (#14815) hai 6 meses
  Georgi Gerganov a5771c9eea ops : update BLAS (#14914) hai 6 meses
  Georgi Gerganov c35f9eaf09 ops : update Metal (#14912) hai 6 meses
  Georgi Gerganov 1f45f2890e sync : ggml hai 6 meses
  Kai Pastor 613c5095c3 cmake : Indent ggml-config.cmake (ggml/1310) hai 6 meses
  Ed Addario 7f97599581 quantize : update README.md (#14905) hai 6 meses
  Ruben Ortlam bf78f5439e vulkan: add ops docs (#14900) hai 6 meses
  Akarshan Biswas bbfc849274 SYCL: add ops doc (#14901) hai 6 meses
  Daniel Bevenius ca0ef2dddb llama : clarify comment about pp and tg graphs [no ci] (#14895) hai 6 meses
  Erik Scholz 89d1029559 vulkan : add fp16 support for the conv_2d kernel (#14872) hai 6 meses
  Jeff Bolz f1a4e72de5 vulkan: skip empty set_rows to avoid invalid API usage (#14860) hai 6 meses
  Gabriel Larson 4762ad7316 model : make rope_yarn_log_mul optional for deepseek2 (#14896) hai 6 meses
  Shunta Saito 1dc9614e06 llama : fix kq_scale for the attention layers of PLaMo2 (#14892) hai 6 meses
  Aman Gupta 446595b9b3 Docs: add instructions for adding backends (#14889) hai 6 meses
  deepsek 66906cd82a HIP: Enable Matrix cores for MMQ Kernels, Enable stream-K for CDNA 3 (#14624) hai 6 meses
  hipudding 11dd5a44eb CANN: Implement GLU ops (#14884) hai 6 meses
  R0CKSTAR 9b8f3c6c77 musa: fix build warnings (unused variable) (#14869) hai 6 meses
  Aaron Teo c7f3169cd5 ggml-cpu : disable GGML_NNPA by default due to instability (#14880) hai 6 meses
  Gabe Goodhart 793c0d7f46 metal: SSM_SCAN performance (#14743) hai 6 meses
  lhez ce111d39d6 opencl: add fused `rms_norm_mul` (#14841) hai 6 meses
  wooksong e7fecba934 docs : update HOWTO‑add‑model.md for ModelBase and new model classes (#14874) hai 6 meses
  Oliver Simons e2b7621e7c ggml : remove invalid portPos specifiers from dot files (#14838) hai 6 meses
  Georgi Gerganov c1dbea752a context : restore preemptive sched reset when LLAMA_SET_ROWS=0 (#14870) hai 6 meses
  kiwi 749e0d27f0 mtmd : fix 32-bit narrowing issue in export-lora and mtmd clip (#14503) hai 6 meses
  Chris Rohlf 64bf1c3744 rpc : check for null buffers in get/set/copy tensor endpoints (#14868) hai 6 meses
  Diego Devesa c12bbde372 sched : fix multiple evaluations of the same graph with pipeline parallelism (#14855) hai 6 meses
  R0CKSTAR 3f4fc97f1d musa: upgrade musa sdk to rc4.2.0 (#14498) hai 6 meses