reasoning-gym/training/qwen-math/recipes
Zafir Stojanovski 0cda6b1205
qwen math training code (#435)
* qwen math training code

* pre-commit
2025-05-16 13:19:19 +02:00
..
accelerate_ds_cfgs qwen math training code (#435) 2025-05-16 13:19:19 +02:00
DeepSeek-R1-Distill-Qwen-1.5B/grpo qwen math training code (#435) 2025-05-16 13:19:19 +02:00
Qwen2.5-3B-Instruct/grpo qwen math training code (#435) 2025-05-16 13:19:19 +02:00