mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2026-04-19 12:58:07 +00:00
* feat: Add support for generating multiple completions per prompt * feat: Track best and mean scores for multiple completions per prompt * feat: Add checkpoint and resume functionality to evaluation script |
||
|---|---|---|
| .. | ||
| claude-3.5-sonnet.yaml | ||
| claude-3.7-sonnet_thinking.yaml | ||
| deepseek-r1.yaml | ||
| llama-3.3-70b-instruct.yaml | ||
| openai-o1.yaml | ||
| openai-o3-mini.yaml | ||