Commit History

Autor SHA1 Mensaxe Data
  Jeff Bolz c10ed6cbcc vulkan: Disable coopmat/coopmat2/bfloat extensions if glslc doesn't support it (#13696) hai 8 meses
  Judd a127ff1780 use LOG_WARN to replace `std::cerr` (#13657) hai 8 meses
  Diego Devesa 3079e9ac8e release : fix windows hip release (#13707) hai 8 meses
  Georgi Gerganov 8a1d206f1d tts : fix n_ubatch + make WavTokenizer cache-less (#13713) hai 8 meses
  Xuan-Son Nguyen 797990c4bc mtmd : add ultravox audio input (#13623) hai 8 meses
  Aaron Teo ab86335760 common: Include torch package for s390x (#13699) hai 8 meses
  Georgi Gerganov cc74d5be99 server : pad small embedding batches (#13692) hai 8 meses
  Sigbjørn Skjæret 5be24af73d gguf-py : correct charsmap parameter typing (#13701) hai 8 meses
  Nicolò Scipione d394a9aedc sycl : Remove waits from function calls (#13702) hai 8 meses
  Ewan Crawford 6b56a64690 SYCL: Avoid using with SYCL-Graph for unsupported nodes (#13587) hai 8 meses
  Henry Linjamäki a4e8912dfd opencl: Add support for multiple devices (#12622) hai 8 meses
  Henry Linjamäki edbf42edfd opencl: fix couple crashes (#12795) hai 8 meses
  Diego Devesa d643bb2c79 releases : build CPU backend separately (windows) (#13642) hai 8 meses
  Georgi Gerganov 8e186ef0e7 hparams : support models for which all layers use SWA (#13682) hai 8 meses
  Georgi Gerganov 5fbfe384d4 server : improve error reporting (#13680) hai 8 meses
  antichristHater c76532e7ba convert : add qwen2vl support for unsloth merges (#13686) hai 8 meses
  Sigbjørn Skjæret 2aa777d86d examples : switch retrieval to llama_encode (#13685) hai 8 meses
  Emmanuel Ferdman eb0f5c28d3 gguf-py : display the invalid gguf type (#13687) hai 8 meses
  Xuan-Son Nguyen cf4cb59e64 ggml : add ggml_gelu_erf() (#13667) hai 8 meses
  Robin Davidsson 0d5c742161 server : Add the endpoints /api/tags and /api/chat (#13659) hai 8 meses
  Dorin-Andrei Geman 42158ae2e8 server : fix first message identification (#13634) hai 8 meses
  Georgi Gerganov 797f2ac062 kv-cache : simplify the interface (#13660) hai 8 meses
  Georgi Gerganov b44890df2e model : disable SWA for Phi models (#13676) hai 8 meses
  R0CKSTAR 33983057d0 musa: Upgrade MUSA SDK version to rc4.0.1 and use mudnn::Unary::IDENTITY op to accelerate D2D memory copy (#13647) hai 8 meses
  Eve fb1cab201c vulkan: fix warnings (#13626) hai 8 meses
  l3utterfly b7a17463ec mtmd-helper : bug fix to token batching in mtmd (#13650) hai 8 meses
  Georgi Gerganov be0239693c model : fix llama4 graph (#13663) hai 8 meses
  Georgi Gerganov a4090d1174 llama : remove llama_kv_cache_view API + remove deprecated (#13653) hai 8 meses
  Johannes Gäßler b69f1647f9 CUDA: skip fully masked-out KV in FA vec kernel (#13584) hai 8 meses
  Sigbjørn Skjæret 759e37b0d8 tests : avoid github urls due to throttling (#13654) hai 8 meses