Commit graph

2 commits

Author SHA1 Message Date
joesharratt1229
1c98584f28
Feat/unsloth example (#482)
* cleaned up examples

* updated failing hooks

* updated readme

* corrected linting checks
2025-06-28 17:04:38 +01:00
joesharratt1229
51c2afc1fc
Fix/verl example (#465)
* updated verl ex

* updated script

* removed curriculum verl and updated

* updatied linting errors

* renamed

* updated config
2025-06-09 09:53:43 +01:00
Renamed from examples/veRL/chain_sum/main_ppo_custom_reward.py (Browse further)