mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2026-04-22 16:49:06 +00:00
* fixes for latest verl * add balance_batch cofg * 1 -> 2 gpu * tweaks * also add raw ids to server script |
||
|---|---|---|
| .. | ||
| OpenRLHF | ||
| trl | ||
| unsloth | ||
| veRL | ||
| word_ladder | ||