reasoning-gym/training/configs/inter_generalisation
2025-04-24 19:36:30 +01:00
..
algebra_qwen_3b.yaml impl conditional reward 2025-04-24 19:36:30 +01:00
algorithmic_qwen_3b.yaml impl conditional reward 2025-04-24 19:36:30 +01:00
games_qwen_3b.yaml impl conditional reward 2025-04-24 19:36:30 +01:00
logic_qwen_3b.yaml impl conditional reward 2025-04-24 19:36:30 +01:00