reasoning-gym/training/trainers
joesharratt1229 4b60c32978
Curr exp (#487)
* began curr exp

* added holdout words

* updated config

* added context

* updated base curriculum

* updaed

* updated curriculum

* updated

* updated

* updated automatic flag

* updated ray trainer

* update
2025-07-25 20:38:47 +01:00
..
__init__.py initial verl training codebase (#389) 2025-03-20 15:04:57 +00:00
ray_grpo_trainer.py Curr exp (#487) 2025-07-25 20:38:47 +01:00