reasoning-gym/training/qwen-math/scripts/training
Zafir Stojanovski 0cda6b1205
qwen math training code (#435)
* qwen math training code

* pre-commit
2025-05-16 13:19:19 +02:00
..
post_train_eval_baselines.sh qwen math training code (#435) 2025-05-16 13:19:19 +02:00
post_train_eval_local.sh qwen math training code (#435) 2025-05-16 13:19:19 +02:00
post_train_grpo.sh qwen math training code (#435) 2025-05-16 13:19:19 +02:00
run_post_train_eval.py qwen math training code (#435) 2025-05-16 13:19:19 +02:00
run_post_train_merge.py qwen math training code (#435) 2025-05-16 13:19:19 +02:00