Commit graph

3 commits

Author SHA1 Message Date
Oliver Stanley
bd13b1b92a
Fix chain sum veRL example for latest veRL (#371)
* fixes for latest verl

* add balance_batch cofg

* 1 -> 2 gpu

* tweaks

* also add raw ids to server script
2025-03-14 20:15:54 +01:00
Andreas Koepf
2ae21c6548 update config to latest veRL version 2025-02-17 18:43:51 +00:00
Andreas Koepf
3f24df31dc add deps for veRL experiment in README 2025-02-01 21:27:33 +00:00