update to tech report version (#10)

* feat(run_eval): add checkpoint resume functionality and update example documentation; - update new bootcamp benchmark dataset * refactor(data_pipeline): optimize data generation pipeline; add multiple preset configurations for data generation * docs: update bootcamp list and add new scripts - Update Fulllist_InternBootcamp.md with new bootcamps and categories - Add new scripts to .gitignore: - examples/pipelines/filter_autogen_configs.py - examples/pipelines/quickgen_data_configs_from_eval_meta.py - Update dependencies in setup.py: - Add scipy and scikit-learn * refactor(internbootcamp): update bootcamp modules and improve error handling - Update import statements in __init__.py files - Add timestamp to target directory name in verl_data_preprocess.py - Improve error handling and scoring logic in bootcamp_judger.py - Remove unnecessary comments and update puzzle descriptions in multiple files
2026-04-22 16:49:04 +00:00 · 2025-08-28 12:39:47 +08:00 · 2025-08-28 12:39:47 +08:00 · a8249acc18
commit a8249acc18
parent 125a7818e0
2952 changed files with 105460 additions and 17649 deletions
--- a/examples/unittests/README.md
+++ b/examples/unittests/README.md
@ -22,7 +22,8 @@ python examples/unittests/run_eval.py \
    --timeout 6000 \
    --api_mode completion \
    --max_retries 16 \
-    --max_retrying_delay 60
+    --max_retrying_delay 60 \
+    --resume
 ```

 ---
@ -46,7 +47,8 @@ Here are the main parameters supported by the script and their meanings:
 | `--sys_prompt`             | str    | `"You are an expert reasoner..."` | System prompt content; only effective when `api_mode` is `chat_completion`. |
 | `--max_retries`            | int    | `16`                            | Number of retries per failed request.                                      |
 | `--max_retrying_delay`     | int    | `60`                            | Maximum delay between retries in seconds.                                  |
-
+| `--resume`                 | bool   | `true`                          | Resume from previous run.                                                  |
+| `--check_model_url`     | bool        | `true`                                     | Check if the model service URL is available before starting the evaluation. |
 ##### Parameter Relationships
 - `--sys_prompt` is only effective if `--api_mode` is set to `chat_completion`.
 - `--template` is only effective if `--api_mode` is set to `completion`.