mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2026-04-22 16:49:06 +00:00
* first trl grpo implementation * added config yaml file * added read me and dependencies * updated reward format func
5 lines
85 B
Markdown
5 lines
85 B
Markdown
1. Install the requirements in the txt file
|
|
|
|
```
|
|
pip install -r requirements.txt
|
|
```
|