Test training with trl (#70)

* first trl grpo implementation
* added config yaml file
* added read me and dependencies
* updated reward format func
This commit is contained in:
joesharratt1229 2025-02-07 06:42:32 +00:00 committed by GitHub
parent a607db79f7
commit d61db3772a
5 changed files with 287 additions and 0 deletions

5
examples/trl/README.md Normal file
View file

@ -0,0 +1,5 @@
1. Install the requirements in the txt file
```
pip install -r requirements.txt
```