diff --git a/README.md b/README.md index 5567d5ec..620a8f7a 100644 --- a/README.md +++ b/README.md @@ -51,7 +51,9 @@ metadata: {'animals': {'sheep': 2, 'dog': 2}, 'total_legs': 16} ## 🔍 Evaluation -Evaluation of the performance of different reasoning models will be tracked in the [reasoning-gym-eval](https://github.com/open-thought/reasoning-gym-eval) repo. +Instructions for running the evaluation scripts are provided in [eval/README.md][https://github.com/open-thought/reasoning-gym/blob/main/eval/README.md]. + +Evaluation results of different reasoning models will be tracked in the [reasoning-gym-eval](https://github.com/open-thought/reasoning-gym-eval) repo. ## 👷 Contributing