mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2026-05-02 17:45:58 +00:00
|
|
||
|---|---|---|
| .. | ||
| accelerate_ds_cfgs | ||
| DeepSeek-R1-Distill-Qwen-1.5B/grpo | ||
| Qwen2.5-3B-Instruct/grpo | ||
|
|
||
|---|---|---|
| .. | ||
| accelerate_ds_cfgs | ||
| DeepSeek-R1-Distill-Qwen-1.5B/grpo | ||
| Qwen2.5-3B-Instruct/grpo | ||