| .. |
|
cmake
|
771d84371c
scripts : update sync + fix cmake merge
|
10 months ago |
|
include
|
7f323a589f
Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386)
|
8 months ago |
|
src
|
7474e00b34
CUDA: fix crash with partial offloading of MoE (#13439)
|
8 months ago |
|
.gitignore
|
17eb6aa8a9
vulkan : cmake integration (#8119)
|
1 year ago |
|
CMakeLists.txt
|
13b0a04597
whisper: remove MSVC warnings pragmas (whisper/3090)
|
8 months ago |