mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2026-04-30 17:40:45 +00:00
3 commits
| Author | SHA1 | Date | |
|---|---|---|---|
|
|
9f9f816902 | ||
|
|
7368d6d313 | ||
|
|
6fa76f11b5 |
Renamed from training/configs/llama3.1_1b_grpo.yaml (Browse further)