reasoning-gym/examples/veRL/chain_sum
Andreas Köpf c69bc5d4e6
Basic curriculum (#198)
* feat: Add optional curriculum support to dataset registration and creation
* docs: Add docstrings to create_curriculum() and register_dataset()
* feat: Add curriculum configuration classes for CurriculumExperiment
* feat: Add weight parameter to CurriculumAttributeConfig and use in DatasetSpec
* refactor: Simplify CurriculumAttributeConfig with "*" attribute level support
* test: Add unit tests for CurriculumExperiment class
* feat: Add from_yaml() method to CurriculumExperimentConfig with unit test
2025-03-07 11:22:12 +01:00
..
config Basic curriculum (#198) 2025-03-07 11:22:12 +01:00
launch_on_2gpu_server.sh Basic curriculum (#198) 2025-03-07 11:22:12 +01:00
launch_on_4gpu.sh Basic curriculum (#198) 2025-03-07 11:22:12 +01:00
main_ppo_custom_reward.py Basic curriculum (#198) 2025-03-07 11:22:12 +01:00
main_ppo_custom_reward_server.py Basic curriculum (#198) 2025-03-07 11:22:12 +01:00
train_grpo.sh Basic curriculum (#198) 2025-03-07 11:22:12 +01:00
train_grpo_server.sh Basic curriculum (#198) 2025-03-07 11:22:12 +01:00
train_ppo.sh Basic curriculum (#198) 2025-03-07 11:22:12 +01:00