reward-kit run
(Recommended for datasets/examples)reward-kit run
CLI command. This uses Hydra for configuration, allowing you to define your dataset, model, and reward logic in YAML files.
examples/
directory at the root of the repository. The main Examples README provides an overview and guidance on their structure. Each example (e.g., examples/math_example/
) has its own README explaining how to run it.
reward-kit preview
reward-kit run
, a preview_input_output_pairs.jsonl
file is typically generated in the output directory. You can use reward-kit preview
to inspect these pairs or re-evaluate them with different metrics: