mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2026-04-19 12:58:07 +00:00
Test training with trl (#70)
* first trl grpo implementation * added config yaml file * added read me and dependencies * updated reward format func
This commit is contained in:
parent
3f6b2fc807
commit
a8e11e71be
5 changed files with 287 additions and 0 deletions
5
examples/trl/README.md
Normal file
5
examples/trl/README.md
Normal file
|
|
@ -0,0 +1,5 @@
|
|||
1. Install the requirements in the txt file
|
||||
|
||||
```
|
||||
pip install -r requirements.txt
|
||||
```
|
||||
Loading…
Add table
Add a link
Reference in a new issue