Commit graph

4 commits

Author SHA1 Message Date
joesharratt1229
1c98584f28
Feat/unsloth example (#482)
* cleaned up examples

* updated failing hooks

* updated readme

* corrected linting checks
2025-06-28 17:04:38 +01:00
Oliver Stanley
bd13b1b92a
Fix chain sum veRL example for latest veRL (#371)
* fixes for latest verl

* add balance_batch cofg

* 1 -> 2 gpu

* tweaks

* also add raw ids to server script
2025-03-14 20:15:54 +01:00
Andreas Koepf
2ae21c6548 update config to latest veRL version 2025-02-17 18:43:51 +00:00
Andreas Koepf
3f24df31dc add deps for veRL experiment in README 2025-02-01 21:27:33 +00:00