Commit History

作者 SHA1 備註 提交日期
  Daniel Bevenius bb77764c2d convert : clarify sentence-transformers-dense-modules help [no ci] (#18662) 3 周之前
  Sigbjørn Skjæret 9dfa8ee950 ci : run cann build unconditionally [no ci] (#18659) 3 周之前
  Jeff Bolz ca4a8370bc vulkan: reject ops when a tensor is too large to allocate (#18646) 3 周之前
  virajwad 03023296cf vulkan: Warptile tuning for Intel Xe2/Xe3 (#18178) 3 周之前
  Eve 8c77a04cc7 vulkan: more mul mat optimizations (#18533) 3 周之前
  Daniel Bevenius ffba4f29e6 examples : add debug utility/example (#18464) 3 周之前
  hipudding 3333951d86 CANN: Fix rename for get_env (#18652) 3 周之前
  Raul Torres 193ee38a1b CANN: Rename `get_env` to `get_env_as_lowercase` (#18624) 3 周之前
  Max Krasnyansky 95ea9e0861 Hexagon add support for f16/f32 flash attention, scale, set-rows and improve f16/32 matmul (#18611) 3 周之前
  Tarek Dakhran ccbc84a537 mtmd: mtmd_audio_streaming_istft (#18645) 3 周之前
  Johannes Gäßler 68b4d516c3 llama-params-fit: fix last devices with low VRAM (#18494) 3 周之前
  Aadeshveer Singh 24af22fc36 ggml : optimize cuda ssm_scan using warp-level reduction (#18505) 3 周之前
  Xuan-Son Nguyen 07fbe19f1f arg: use CSV escape style for multiple-value args (#18643) 3 周之前
  Jeff Bolz ea13cba850 vulkan: support buffer_from_host_ptr (#18467) 3 周之前
  Aman Gupta 090b137e56 ggml-cuda: refactor cuda graph usage (#18637) 3 周之前
  Beinsezii 968929528c mmq.cu: tune mmq/rocblas switching for RDNA (#18537) 3 周之前
  R 3d26a09dc7 server : add thinking content blocks to Anthropic Messages API (#18551) 3 周之前
  Christian Kastner bd2a93d475 gguf-py : add requests to dependencies (#18629) 3 周之前
  Adrien Gallouët e75ee11024 ggml : fix avx512bf16 build (#18623) 3 周之前
  Raul Torres da9b8d3300 CANN: Make `valid_values` variable `static const` (#18627) 3 周之前
  nwyin e443fbcfa5 ggml webgpu: add CEIL operation support (#18605) 3 周之前
  Tarek Dakhran 73d284a250 model : add LFM2-ColBert-350M (#18607) 3 周之前
  Johannes Gäßler df17a4c94f CUDA: fix FA FP16 accumulator overflow for Granite (#18614) 3 周之前
  tt 1871f0ba56 add YoutuVLForConditionalGeneration architectures (#18620) 3 周之前
  Aman Gupta f47edb8c19 ggml-cuda: check for srcs outside the cgraph (#18583) 3 周之前
  Vladislav Sayapin da143b9940 server : fix router child env in containerized environments (#18562) 3 周之前
  Jeff Bolz f1768d8f03 vulkan: fix topk_moe_sigmoid_norm_bias failures in GLM-4.6 (#18582) 3 周之前
  Georgi Gerganov 2da64a2f8a models : fix backend assignment for Granite/Nemotron graphs (#18599) 3 周之前
  Jeff Bolz b37124d2d2 vulkan: handle quantize_q8_1 overflowing the max workgroup count (#18515) 3 周之前
  Sigbjørn Skjæret eadc4184ca llama : refactor rope_freq_base/scale_swa conversion and init (#18553) 3 周之前