reasoning-gym/training/utils
Oliver Stanley eb69916c1b
initial verl training codebase (#389)
* fixes for latest verl
* composite dataset training experiment
* use stateful dataloaders to match verl changes
* training readme
* add formatting reward
* length reward impl
* standalone reasoning_gym config section
* curriculum learning, new length reward, more config
2025-03-20 15:04:57 +00:00
..
__init__.py initial verl training codebase (#389) 2025-03-20 15:04:57 +00:00
datasets.py initial verl training codebase (#389) 2025-03-20 15:04:57 +00:00