reasoning-gym/examples
joesharratt1229 51c2afc1fc
Fix/verl example (#465)
* updated verl ex

* updated script

* removed curriculum verl and updated

* updatied linting errors

* renamed

* updated config
2025-06-09 09:53:43 +01:00
..
open-instruct Feat/open instruct example (#381) 2025-03-17 23:20:11 +01:00
OpenRLHF use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
trl docs: Update TRL README with GRPO example details and usage instructions (#76) 2025-02-07 07:56:22 +01:00
unsloth Better progress tracking 2025-02-20 23:32:54 +00:00
veRL Fix/verl example (#465) 2025-06-09 09:53:43 +01:00
word_ladder more native type hints 2025-02-21 21:23:14 +01:00