Trang chủ Khám phá Trợ giúp

cturan

/

llama.cpp

mirror of https://github.com/cturan/llama.cpp

1

0

Các tập tin Các vấn đề 0 Wiki

Branch: master

k2v2

master

minimax

qwen3_next

qwen3_next_optimized

toolinjection

test

b6814

llama.cpp

/

examples

/

speculative

/

README.md

README.md 282 B

Permalink Lịch sử Raw

llama.cpp/examples/speculative

Demonstration of speculative decoding and tree-based speculative decoding techniques

More info:

https://github.com/ggml-org/llama.cpp/pull/2926
https://github.com/ggml-org/llama.cpp/pull/3624
https://github.com/ggml-org/llama.cpp/pull/5625

© 2026 Git

Trang: 437ms Mẫu: 30ms

Vietnamese

Vietnamese English 简体中文繁體中文（香港）繁體中文（臺灣） Deutsch français Nederlands latviešu русский 日本語 español português do Brasil polski български italiano suomi Türkçe čeština српски svenska 한국어 galego українська English (United Kingdom) Magyar Slovenčina Indonesian Persian Português Монгол Română

Javascript Licenses Website