[eval-basic] initial scripts for evaluating models on reasoning gym

This commit is contained in:
rishabhranawat 2025-02-09 22:36:27 -08:00
parent 8c4400b18a
commit 75cfd31ec2
11 changed files with 1306 additions and 0 deletions

View file

@ -5,3 +5,4 @@ isort>=5.13.2
flake8>=7.1.1
mypy>=1.14.1
pre-commit>=4.1.0
openai>=1.61.1