[pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci
This commit is contained in:
pre-commit-ci[bot] 2025-07-30 15:10:32 +00:00
parent 75f1cf6d2a
commit 65aea8bb21

View file

@ -281,4 +281,4 @@ python arena_hard_environment.py evaluate \
- **Robust Parsing**: Multiple regex patterns for judgment extraction ([[A>B]], [[B>A]], [[A=B]])
- **Thinking Validation**: Strict validation of thinking tag format and content extraction
- **Error Handling**: Comprehensive retry logic with exponential backoff
- **Arena-Hard Compatibility**: Scores and metrics match original Arena-Hard methodology
- **Arena-Hard Compatibility**: Scores and metrics match original Arena-Hard methodology