updated for config by dataset (#257)

* updated for config by dataset

* updated read me
This commit is contained in:
joesharratt1229 2025-03-03 21:58:32 +01:00 committed by GitHub
parent c0cf237474
commit 6770ee3eef
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
2 changed files with 40 additions and 7 deletions

View file

@ -126,11 +126,24 @@ Options:
- `--size`: Default dataset size (default: 100)
- `--seed`: Default dataset seed (default: 42)
- `--include-params`: Include all configuration parameters (default: False)
- `--category`: Only include datasets from this category (default: None)
#### Generating Config for a Specific Category
To generate a configuration file containing only datasets from a specific category:
```bash
python generate_config.py --category algorithmic --output algorithmic_datasets.yaml --model "anthropic/claude-3.5-sonnet"
```
This will create a configuration file that includes only datasets in the "algorithmic" category. This is useful when you want to focus your evaluation on a specific type of reasoning tasks.
Example categories include: math, arithmetic, reasoning, algorithmic, etc. The category is automatically extracted from the dataset's module name (e.g., from `reasoning_gym.math.dataset_name`, it extracts "math").
You can see all available categories by running the script without the `--category` option, as it will print all categories at the end of execution.
### Running Evaluations
To run evaluations:
```bash
python eval.py --config configs/your_config.yaml
```