Commit History

Author SHA1 Message Date
  Giuseppe Scrivano 0398752dd4 model : add Granite Hybrid types (#16635) 3 months ago
  Aaron Teo 4f73d0a951 ci : fix binaries release failure for s390x (binaries may not work yet) (#16664) 3 months ago
  Sigbjørn Skjæret cec5edbcae ci : avoid manual updates of docs/ops.md (#16663) 3 months ago
  Aaron Teo fcb235b466 ci: include s390x release binaries (#16648) 3 months ago
  Aman Gupta 55754bebd5 CODEOWNERS: update for ggml-cuda/mmf (#16660) 3 months ago
  Johannes Gäßler ee09828cb0 HIP: fix GPU_TARGETS (#16642) 3 months ago
  Jeff Bolz e56abd2098 vulkan: Implement topk_moe fused shader, ported from CUDA (#16641) 3 months ago
  Aman Gupta 38355c6c8e CUDA: use registers instead of smem in topk-moe (#16647) 3 months ago
  Shawn Gu 81387858f1 opencl: transposed gemm/gemv moe kernel with mxfp4,f32 (#16602) 3 months ago
  Johannes Gäßler 66b0dbcb2d llama-model: fix insonsistent ctxs <-> bufs order (#16581) 3 months ago
  Radoslav Gerganov 41386cf365 rpc : report actual free memory (#16616) 3 months ago
  Giuseppe Scrivano 3d4e86bbeb vulkan: Add State Space Model (SSM) Operations Support (#16463) 3 months ago
  muggle-stack 342c728d03 ggml : fix SpaceMit IME array out-of-bounds in task assignment (#16629) 3 months ago
  Pascal ababae7e1e webui: reorganize settings layout (#16607) 3 months ago
  Jeff Bolz b19491599d vulkan: fix debug build (add_rms_len/data not found) (#16624) 3 months ago
  Ilia Ilmer 9ad4f1931e metal : add `CONV_TRANSPOSE_2D` (#16542) 3 months ago
  Olivier Chafik 79967ec596 grammar : use int64_t to avoid int overflows in int schema to grammar conversion logic (#16626) 3 months ago
  GittyBurstein ceff6bb253 SYCL SET operator optimized for F32 tensors (#16350) 3 months ago
  Xuan-Son Nguyen 1bb4f43380 mtmd : support home-cooked Mistral Small Omni (#14928) 3 months ago
  Pascal 683fa6ba4e fix: added a normalization step for MathJax-style \[\] and \(\) delimiters (#16599) 3 months ago
  GittyBurstein b22572e97d sycl : add ARANGE operator (#16362) 3 months ago
  Chenguang Li 7a50cf388a CANN: format code using .clang-format (#15863) 3 months ago
  takasurazeem 6f5d924637 common : Update the docs on -t --threads (#16236) 3 months ago
  takuya kodama adc9b60f19 ggml-cpu: replace putenv with setenv for const-correctness (#16573) 3 months ago
  yael-works ee50ee1ead SYCL: Add GGML_OP_MEAN operator support (#16009) 3 months ago
  Aleksei Nikiforov 7adc79c032 gguf-py : add support for endian conversion of BF16 data (#16594) 3 months ago
  safranowith 466c1911ab cpu : add FLOOR, CEIL, ROUND and TRUNC unary operators (#16083) 3 months ago
  lhez 0cb7a0683b opencl: add q8_0 mm support (#16469) 3 months ago
  lhez d93f8439b0 opencl: fix FA for f32 (#16584) 3 months ago
  Aleksander Grygier f9fb33f263 Add server-driven parameter defaults and syncing (#16515) 3 months ago