mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2026-05-01 17:45:24 +00:00
* cleaned up examples * updated failing hooks * updated readme * corrected linting checks |
||
|---|---|---|
| .. | ||
| grpo_trainer.yaml | ||
* cleaned up examples * updated failing hooks * updated readme * corrected linting checks |
||
|---|---|---|
| .. | ||
| grpo_trainer.yaml | ||