# Minimal Diplomacy Environment

A simplified Diplomacy RL training environment for Atropos that integrates with AI_Diplomacy.

## Overview

This minimal implementation provides:
- Basic game integration via AI_Diplomacy submodule
- Parallel rollouts with configurable group_size
- LLM request interception through AtroposClient proxy
- Simple supply center based scoring
- No complex features (no GRPO, memory systems, or advanced scoring)

## Architecture

```
Atropos Policy Server
        ↓
AtroposClientMinimal (proxy)
        ↓
AI_Diplomacy Game Engine
        ↓
Game Execution
```

## Quick Start

1. Install dependencies:
```bash
pip install -r requirements.txt
cd AI_Diplomacy
pip install -e .
```

2. Start your Atropos policy server on port 8000

3. Run the environment:
```bash
python diplomacy_env_minimal.py serve
```

## Configuration

Key settings in `DiplomacyEnvMinimalConfig`:
- `max_game_turns`: Number of game turns (default: 10)
- `training_power`: Which power the RL agent controls (default: "FRANCE")
- `group_size`: Number of parallel games per trajectory (default: 4)

## How It Works

1. **Parallel Rollouts**: Each training step runs `group_size` games with the same initial seed
2. **LLM Interception**: AtroposClientMinimal intercepts all LLM calls from AI_Diplomacy
3. **Trajectory Collection**: Game interactions are collected and scored
4. **Best Selection**: The highest scoring trajectory is returned for training