Files
admin 8e5a449007
Some checks are pending
Copilot Setup Steps / copilot-setup-steps (push) Waiting to run
Check Pre-Tokenizer Hashes / pre-tokenizer-hashes (push) Waiting to run
Python check requirements.txt / check-requirements (push) Waiting to run
Python Type-Check / python type-check (push) Waiting to run
Update Operations Documentation / update-ops-docs (push) Waiting to run
llama.cpp verification source 2026-05-22
2026-05-22 16:44:08 +08:00
..

Results

The llama-results tool can be used to --check the outputs of a model vs. a previous commit to detect whether they have changed. Example usage:

llama-results --model model.gguf --output results.gguf --prompt "People die when they are killed."  # writes results to file
llama-results --model model.gguf --output results.gguf --prompt "People die when they are killed." --check  # compares results vs file

The metric by which the results are compared is the normalized mean squared error (NMSE) with a tolerance of 10^{-6}.