diff --git a/README.md b/README.md index 0ae8b8ae..15ae2475 100644 --- a/README.md +++ b/README.md @@ -22,7 +22,24 @@ -Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments including: +--- + +## What is Atropos? +Atropos is an environments provider and framework for RL Training. Atropos encompasses both the environments used during the RL Training and sets them up as services and a trajectory API for the environments to send data created by the environments to and collate them for the trainer to pull batches created by the environments. + + + +Atropos is a robust, scalable framework for **Reinforcement Learning Environments with LLMs**. Key features: + +- **Multi-Turn & Asynchronous RL:** Efficiently supports complex, multi-turn, and asynchronous interactions, decoupling environment steps from policy updates. +- **Inference Agnostic:** Integrates with standard inference APIs (e.g., OpenAI, vLLM, SGLang), enabling easy switching between LLM providers and frameworks. +- **Trainer Independent:** Offers a standardized training interface for experimenting with different RL algorithms and frameworks without major code changes. +- **Scalable & Decentralized:** Easily scale by launching more environment instances (locally or across decentralized resources) that contribute rollouts to a central service. +- **Diverse Environment Integration:** Manages many varied environment types concurrently for heterogeneous, multi-modal training. + +The goal: provide a flexible, scalable, and standardized platform to accelerate LLM-based RL research across diverse, interactive settings. + +The framework supports collecting, distributing and evaluating LLM trajectories through diverse environments including: