diff --git a/eval/README.md b/eval/README.md index b64f9926..d8bbe3a0 100644 --- a/eval/README.md +++ b/eval/README.md @@ -18,12 +18,17 @@ This framework provides tools to evaluate language models on the reasoning_gym d ## Setup -1. Set your OpenRouter API key as an environment variable: +1. Install the required dependencies: +```bash +pip install -r requirements.txt +``` + +2. Set your OpenRouter API key as an environment variable: ```bash export OPENROUTER_API_KEY=your-api-key ``` -2. Prepare your dataset configuration in JSON format (e.g., `eval_basic.json`): +3. Prepare your dataset configuration in JSON format (e.g., `eval_basic.json`): ```json [ { diff --git a/eval/eval.sh b/eval/eval.sh old mode 100644 new mode 100755 diff --git a/eval/requirements.txt b/eval/requirements.txt new file mode 100644 index 00000000..7bdefc17 --- /dev/null +++ b/eval/requirements.txt @@ -0,0 +1 @@ +openai>=1.64.0