feat(run_eval): add checkpoint resume functionality and update example documentation;

- update new bootcamp benchmark dataset
This commit is contained in:
chenyongkang 2025-08-26 16:50:52 +08:00
parent 125a7818e0
commit 1a8477c8d8
166 changed files with 8877 additions and 5047 deletions

View file

@ -60,7 +60,8 @@ python examples/unittests/run_eval.py \
--max_concurrent_requests 128 \
--timeout 6000 \
--max_retries 16 \
--max_retrying_delay 60
--max_retrying_delay 60 \
--resume
```
Note: When `api_mode` is set to `completion`, ensure that the corresponding `template` is correctly configured (supported options include `r1`, `qwen`, `internthinker`, and `chatml` (no system prompt)). For more details, refer to the [Evaluation Manual](/examples/unittests/README.md).