Some checks are pending
Copilot Setup Steps / copilot-setup-steps (push) Waiting to run
Check Pre-Tokenizer Hashes / pre-tokenizer-hashes (push) Waiting to run
Python check requirements.txt / check-requirements (push) Waiting to run
Python Type-Check / python type-check (push) Waiting to run
Update Operations Documentation / update-ops-docs (push) Waiting to run
10 lines
282 B
Markdown
10 lines
282 B
Markdown
# llama.cpp/examples/speculative
|
|
|
|
Demonstration of speculative decoding and tree-based speculative decoding techniques
|
|
|
|
More info:
|
|
|
|
- https://github.com/ggml-org/llama.cpp/pull/2926
|
|
- https://github.com/ggml-org/llama.cpp/pull/3624
|
|
- https://github.com/ggml-org/llama.cpp/pull/5625
|