Fix BLEUBERI environment server integration

This commit is contained in:
Allan Niemerg 2025-06-09 10:28:00 -05:00
parent 3109fe349b
commit 64a82c4b4f
3 changed files with 59 additions and 23 deletions

View file

@ -10,6 +10,22 @@ BLEUBERI uses BLEU scores (a simple n-gram matching metric) directly as rewards
2. Computes BLEU scores by comparing model outputs to these references
3. Uses these scores as rewards to train models through GRPO
## Installation
Before using the BLEUBERI environment, you need to install its dependencies:
```bash
# Install the required dependencies
pip install -r environments/bleuberi/requirements.txt
```
The key dependencies include:
- `model2vec`: For embedding-based similarity metrics
- `bert-score`: For semantic similarity evaluation
- `sacrebleu`: For BLEU score calculation
- `evaluate`: For evaluation metrics
- `datasets`: For dataset handling
## Features
- BLEU-based reward functions (with support for multiple reference models)