reasoning-gym/training/configs/intra_generalisation
2025-04-24 19:36:30 +01:00
..
algebra_qwen_3b.yaml impl conditional reward 2025-04-24 19:36:30 +01:00
algorithmic_qwen_3b.yaml impl conditional reward 2025-04-24 19:36:30 +01:00
arithmetic_qwen_3b.yaml impl conditional reward 2025-04-24 19:36:30 +01:00
cognition_qwen_3b.yaml impl conditional reward 2025-04-24 19:36:30 +01:00
games_qwen_3b.yaml impl conditional reward 2025-04-24 19:36:30 +01:00
graphs_qwen_3b.yaml impl conditional reward 2025-04-24 19:36:30 +01:00