mirror of
https://github.com/NousResearch/atropos.git
synced 2026-04-28 17:29:30 +00:00
Fix BLEUBERI environment server integration
This commit is contained in:
parent
3109fe349b
commit
64a82c4b4f
3 changed files with 59 additions and 23 deletions
|
|
@ -10,6 +10,22 @@ BLEUBERI uses BLEU scores (a simple n-gram matching metric) directly as rewards
|
|||
2. Computes BLEU scores by comparing model outputs to these references
|
||||
3. Uses these scores as rewards to train models through GRPO
|
||||
|
||||
## Installation
|
||||
|
||||
Before using the BLEUBERI environment, you need to install its dependencies:
|
||||
|
||||
```bash
|
||||
# Install the required dependencies
|
||||
pip install -r environments/bleuberi/requirements.txt
|
||||
```
|
||||
|
||||
The key dependencies include:
|
||||
- `model2vec`: For embedding-based similarity metrics
|
||||
- `bert-score`: For semantic similarity evaluation
|
||||
- `sacrebleu`: For BLEU score calculation
|
||||
- `evaluate`: For evaluation metrics
|
||||
- `datasets`: For dataset handling
|
||||
|
||||
## Features
|
||||
|
||||
- BLEU-based reward functions (with support for multiple reference models)
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue