updated failing hooks

This commit is contained in:
joesharratt1229 2025-06-27 07:59:58 +00:00
parent 799eb51800
commit c44ff8c542

View file

@ -23,8 +23,8 @@ python >= 3.10
python train_grpo_lora.py
```
To customise/override any default arguments you can simply:
To customise/override any default arguments you can simply:
```bash
python train_grpo_lora.py --dataset-name chain_sum --max-seq-length 512 --model-id Qwen/Qwen2.5-7B-Instruct
**Note** the free open-source version of unsloth is currently built to train models in single GPU environments only.
**Note** the free open-source version of unsloth is currently built to train models in single GPU environments only.