reasoning-gym/examples/trl/config
joesharratt1229 1c98584f28
Feat/unsloth example (#482)
* cleaned up examples

* updated failing hooks

* updated readme

* corrected linting checks
2025-06-28 17:04:38 +01:00
..
ds_zero2.yaml tutorial(training): Add a minimal example with trl (#473) 2025-06-21 00:01:31 +02:00
grpo.yaml Feat/unsloth example (#482) 2025-06-28 17:04:38 +01:00