Johannes Gäßler 7474e00b34 CUDA: fix crash with partial offloading of MoE (#13439) 8 months ago
..
cmake 771d84371c scripts : update sync + fix cmake merge 10 months ago
include 7f323a589f Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386) 8 months ago
src 7474e00b34 CUDA: fix crash with partial offloading of MoE (#13439) 8 months ago
.gitignore 17eb6aa8a9 vulkan : cmake integration (#8119) 1 year ago
CMakeLists.txt 13b0a04597 whisper: remove MSVC warnings pragmas (whisper/3090) 8 months ago