reasoning-gym/training/trainers/__init__.py at 7fdba320460c125f4e113efb1cb7c418b228d3af - open-thought/reasoning-gym - Forgejo: Beyond coding. We Forge.

open-thought/reasoning-gym

mirror of https://github.com/open-thought/reasoning-gym.git synced 2026-04-24 17:05:03 +00:00

Oliver Stanley eb69916c1b

initial verl training codebase (#389 )

* fixes for latest verl
* composite dataset training experiment
* use stateful dataloaders to match verl changes
* training readme
* add formatting reward
* length reward impl
* standalone reasoning_gym config section
* curriculum learning, new length reward, more config

2025-03-20 15:04:57 +00:00

3 lines

75 B

Python

Raw Blame History

	`from .ray_grpo_trainer import RayGRPOTrainer`

	`__all__ = ["RayGRPOTrainer"]`