reasoning-gym/eval/yaml
Andreas Köpf 850c1cf6f4
Eval script consolidation (#238)
The script now supports:
   - YAML and JSON configurations
   - Dataset-specific parameters
   - Overriding configuration via command line
   - Detailed logging and error handling
2025-02-27 17:39:14 +01:00
..
claude-3.5-sonnet.yaml Eval script consolidation (#238) 2025-02-27 17:39:14 +01:00
deepseek-r1.yaml Eval script consolidation (#238) 2025-02-27 17:39:14 +01:00
llama-3.3-70b-instruct.yaml Eval script consolidation (#238) 2025-02-27 17:39:14 +01:00
openai-o3.yaml Eval script consolidation (#238) 2025-02-27 17:39:14 +01:00