mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2026-04-19 12:58:07 +00:00
updated failing hooks
This commit is contained in:
parent
799eb51800
commit
c44ff8c542
1 changed files with 2 additions and 2 deletions
|
|
@ -23,8 +23,8 @@ python >= 3.10
|
|||
python train_grpo_lora.py
|
||||
```
|
||||
|
||||
To customise/override any default arguments you can simply:
|
||||
To customise/override any default arguments you can simply:
|
||||
```bash
|
||||
python train_grpo_lora.py --dataset-name chain_sum --max-seq-length 512 --model-id Qwen/Qwen2.5-7B-Instruct
|
||||
|
||||
**Note** the free open-source version of unsloth is currently built to train models in single GPU environments only.
|
||||
**Note** the free open-source version of unsloth is currently built to train models in single GPU environments only.
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue