reasoning-gym/examples
joesharratt1229 9234aa77bf
Feat/open instruct example (#381)
* added open-instruct

* fixed hooks

* GRPO

---------

Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-17 23:20:11 +01:00
..
open-instruct Feat/open instruct example (#381) 2025-03-17 23:20:11 +01:00
OpenRLHF use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
trl docs: Update TRL README with GRPO example details and usage instructions (#76) 2025-02-07 07:56:22 +01:00
unsloth Better progress tracking 2025-02-20 23:32:54 +00:00
veRL use StatefulDataLoader in veRL examples (#378) 2025-03-17 07:28:10 +01:00
word_ladder more native type hints 2025-02-21 21:23:14 +01:00