cturan/llama.cpp

同期ミラー https://github.com/cturan/llama.cpp

作者	SHA1 メッセージ	日付
David Huang	7f323a589f Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386)	8 ヶ月前
fairydreaming	8fcb563613 Load all MoE experts during warmup (#11571)	10 ヶ月前
Georgi Gerganov	f66f582927 llama : refactor `src/llama.cpp` (#10902)	1 年間前