docs: Update installation instructions in eval README

2026-04-19 12:58:07 +00:00 · 2025-02-25 15:28:12 +01:00 · 2025-02-25 15:28:12 +01:00 · e48c1f82cd
commit e48c1f82cd
parent a1b0a0414e
2 changed files with 12 additions and 5 deletions
--- a/eval/README.md
+++ b/eval/README.md
@ -18,17 +18,22 @@ This framework provides tools to evaluate language models on the reasoning_gym d

 ## Setup

-1. Install the required dependencies:
+1. Install reasoning-gym in development mode:
 ```bash
-pip install -r requirements.txt
+pip install -e ..
 ```

-2. Set your OpenRouter API key as an environment variable:
+2. Install the additional dependencies required for evaluation:
+```bash
+pip install -r requirements-eval.txt
+```
+
+3. Set your OpenRouter API key as an environment variable:
 ```bash
 export OPENROUTER_API_KEY=your-api-key
 ```

-3. Prepare your dataset configuration in JSON format (e.g., `eval_basic.json`):
+4. Prepare your dataset configuration in JSON format (e.g., `eval_basic.json`):
 ```json
 [
  {
@ -47,9 +52,11 @@ You can run evaluations in two ways:

 1. Using the provided bash script:
 ```bash
-./run_eval.sh
+./eval.sh
 ```

+   Before running, you may want to edit the `eval.sh` script to configure which models to evaluate by modifying the `MODELS` array.
+
 2. Running the Python script directly:
 ```bash
 python eval.py --model "model-name" --config "eval_basic.json" --output-dir "results"
--- a/eval/requirements-eval.txt
+++ b/eval/requirements-eval.txt