cturan/llama.cpp @ 0a5eebb45d5697127b84418576dc479c400c4b3d

同期ミラー https://github.com/cturan/llama.cpp

FK 84e723653c speculative: add --n-gpu-layers-draft option (#3063)		2 年前
..
CMakeLists.txt	47068e5170 speculative : PoC for speeding-up inference via speculative sampling (#2926)	2 年前
speculative.cpp	84e723653c speculative: add --n-gpu-layers-draft option (#3063)	2 年前