mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2026-04-26 17:13:17 +00:00
|
|
||
|---|---|---|
| .. | ||
| accelerate_ds_cfgs | ||
| DeepSeek-R1-Distill-Qwen-1.5B/grpo | ||
| Qwen2.5-3B-Instruct/grpo | ||
|
|
||
|---|---|---|
| .. | ||
| accelerate_ds_cfgs | ||
| DeepSeek-R1-Distill-Qwen-1.5B/grpo | ||
| Qwen2.5-3B-Instruct/grpo | ||