mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2026-04-26 17:13:17 +00:00
initial verl training codebase (#389)
* fixes for latest verl * composite dataset training experiment * use stateful dataloaders to match verl changes * training readme * add formatting reward * length reward impl * standalone reasoning_gym config section * curriculum learning, new length reward, more config
This commit is contained in:
parent
ce0a6c4878
commit
eb69916c1b
8 changed files with 910 additions and 0 deletions
3
training/utils/__init__.py
Normal file
3
training/utils/__init__.py
Normal file
|
|
@ -0,0 +1,3 @@
|
|||
from .datasets import ReasoningGymDataset, make_dataset
|
||||
|
||||
__all__ = ["ReasoningGymDataset", "make_dataset"]
|
||||
Loading…
Add table
Add a link
Reference in a new issue