mirror of
https://github.com/NousResearch/atropos.git
synced 2026-04-19 12:57:58 +00:00
Update README.md
This commit is contained in:
parent
a282604baa
commit
3863ece98b
1 changed files with 2 additions and 0 deletions
|
|
@ -2,6 +2,8 @@
|
|||
|
||||
This directory contains an example script (`grpo.py`) demonstrating how to integrate a custom training loop with the Atropos API for reinforcement learning using the GRPO (Group Relative Policy Optimization) algorithm.
|
||||
|
||||
**Note: Example trainer does not support multimodal training out of the box. As other trainers add support for Atropos, we will list them in the main readme, some of which may support multimodal RL - please check the main repo readme for any updates.**
|
||||
|
||||
This example uses `vLLM` for efficient inference during the (simulated) data generation phase and `transformers` for the training phase.
|
||||
|
||||
**Note:** This script is intended as a *reference example* for API integration and basic training setup. It is not optimized for large-scale, efficient training.
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue