Commit History

Author SHA1 Message Date
  Noah 1f5accb8d0 Fix garbled output with REPACK at high thread counts (#16956) 2 months ago
  Aman Gupta 2759ccdb4a CUDA: avoid mul + bias fusion when doing fusion (#16935) 2 months ago
  lhez c5023daf60 opencl: support imrope (#16914) 2 months ago
  Aleksander Grygier e7da30b584 fix: Viewing multiple PDF attachments (#16974) 2 months ago
  Daniel Bevenius ed8aa63320 model-conversion : pass config to from_pretrained (#16963) 2 months ago
  Georgi Gerganov 48bd26501b server : add props.model_alias (#16943) 2 months ago
  theo77186 622cd010ff ggml: CUDA: add head size 72 for flash-attn (#16962) 2 months ago
  Xuan-Son Nguyen 070ff4d535 mtmd: add --image-min/max-tokens (#16921) 2 months ago
  Xuan-Son Nguyen bf7b0c9725 mtmd: pad mask for qwen2.5vl (#16954) 2 months ago
  Jinyang He fcfce040e8 ggml : LoongArch fixes (#16958) 2 months ago
  Olivier Chafik ee3a5a10ad sync: minja (glm 4.6 & minmax m2 templates) (#16949) 2 months ago
  shani-f 7e994168b1 SYCL: optimized repeat_back kernel (3× fewer asm instructions, 2× faster)Feature/sycl repeat back opt (#16869) 2 months ago
  Sascha Rogmann bcfa87622a feat(webui): improve LaTeX rendering with currency detection (#16508) 2 months ago
  Shagun Bera a2054e3a8f test-backend-ops : fix segfault in moe-expert-reduce test in support mode and coverage (#16936) 2 months ago
  Sigbjørn Skjæret dd52868050 ci : disable failing riscv cross build (#16952) 2 months ago
  Zhiyong Wang 6b9a52422b model: add Janus Pro for image understanding (#16906) 2 months ago
  Georgi Gerganov 2f966b8ed8 clip : use FA (#16837) 2 months ago
  Georgi Gerganov cd5e3b5754 server : support unified cache across slots (#16736) 2 months ago
  Aldehir Rojas 87c9efc3b2 common : move gpt-oss reasoning processing to init params (#16937) 2 months ago
  Adrian Lundberg 76af40aaaa docs: remove llama_sampler_accept reference in sampling sample usage (#16920) 2 months ago
  mnehete32 7db35a7958 CUDA: add FLOOR, CEIL, ROUND, TRUNC unary ops (#16917) 2 months ago
  Aaron Teo a864132ba5 devops: fix failing s390x docker build (#16918) 2 months ago
  Aaron Teo d38d9f0877 ggml: add s390x cpu-feats (#16774) 2 months ago
  Georgi Gerganov 7fd205a8e8 scripts : add script to bench models (#16894) 2 months ago
  Pascal 2f68ce7cfd webui: auto-refresh /props on inference start to resync model metadata (#16784) 2 months ago
  Pascal e4a71599e5 webui: add HTML/JS preview support to MarkdownContent with sandboxed iframe (#16757) 2 months ago
  Adrien Gallouët dd5e8cab51 vendor : update cpp-httplib to 0.27.0 (#16846) 2 months ago
  Xuan-Son Nguyen cf659bbb8e mtmd: refactor preprocessing + support max/min pixels (#16878) 2 months ago
  Aleksander Grygier d8b860a219 Add a setting to display message generation statistics (#16901) 2 months ago
  Jaromír Hradílek 1ae74882f8 webui: recognize AsciiDoc files as valid text files (#16850) 2 months ago