mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2026-04-19 12:58:07 +00:00
* added open-instruct * fixed hooks * GRPO --------- Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com> |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| grpo_trainer.py | ||
| utils.py | ||