llama.cpp verification source 2026-05-22
Some checks are pending
Copilot Setup Steps / copilot-setup-steps (push) Waiting to run
Check Pre-Tokenizer Hashes / pre-tokenizer-hashes (push) Waiting to run
Python check requirements.txt / check-requirements (push) Waiting to run
Python Type-Check / python type-check (push) Waiting to run
Update Operations Documentation / update-ops-docs (push) Waiting to run
Some checks are pending
Copilot Setup Steps / copilot-setup-steps (push) Waiting to run
Check Pre-Tokenizer Hashes / pre-tokenizer-hashes (push) Waiting to run
Python check requirements.txt / check-requirements (push) Waiting to run
Python Type-Check / python type-check (push) Waiting to run
Update Operations Documentation / update-ops-docs (push) Waiting to run
This commit is contained in:
11
tools/results/README.md
Normal file
11
tools/results/README.md
Normal file
@@ -0,0 +1,11 @@
|
||||
# Results
|
||||
|
||||
The `llama-results` tool can be used to `--check` the outputs of a model vs. a previous commit to detect whether they have changed.
|
||||
Example usage:
|
||||
|
||||
``` sh
|
||||
llama-results --model model.gguf --output results.gguf --prompt "People die when they are killed." # writes results to file
|
||||
llama-results --model model.gguf --output results.gguf --prompt "People die when they are killed." --check # compares results vs file
|
||||
```
|
||||
|
||||
The metric by which the results are compared is the normalized mean squared error (NMSE) with a tolerance of $10^{-6}$.
|
||||
Reference in New Issue
Block a user