reasoning-gym

mirror of https://github.com/open-thought/reasoning-gym.git synced 2026-04-23 16:55:05 +00:00

History

Oliver Stanley eb69916c1b initial verl training codebase (#389 ) * fixes for latest verl * composite dataset training experiment * use stateful dataloaders to match verl changes * training readme * add formatting reward * length reward impl * standalone reasoning_gym config section * curriculum learning, new length reward, more config	2025-03-20 15:04:57 +00:00
..
__init__.py	initial verl training codebase (#389 )	2025-03-20 15:04:57 +00:00
datasets.py	initial verl training codebase (#389 )	2025-03-20 15:04:57 +00:00

* fixes for latest verl
* composite dataset training experiment
* use stateful dataloaders to match verl changes
* training readme
* add formatting reward
* length reward impl
* standalone reasoning_gym config section
* curriculum learning, new length reward, more config

2025-03-20 15:04:57 +00:00

__init__.py

initial verl training codebase (#389 )

2025-03-20 15:04:57 +00:00

datasets.py

initial verl training codebase (#389 )

2025-03-20 15:04:57 +00:00