update to tech report version (#10)

* feat(run_eval): add checkpoint resume functionality and update example documentation; - update new bootcamp benchmark dataset * refactor(data_pipeline): optimize data generation pipeline; add multiple preset configurations for data generation * docs: update bootcamp list and add new scripts - Update Fulllist_InternBootcamp.md with new bootcamps and categories - Add new scripts to .gitignore: - examples/pipelines/filter_autogen_configs.py - examples/pipelines/quickgen_data_configs_from_eval_meta.py - Update dependencies in setup.py: - Add scipy and scikit-learn * refactor(internbootcamp): update bootcamp modules and improve error handling - Update import statements in __init__.py files - Add timestamp to target directory name in verl_data_preprocess.py - Improve error handling and scoring logic in bootcamp_judger.py - Remove unnecessary comments and update puzzle descriptions in multiple files
2026-04-19 12:58:04 +00:00 · 2025-08-28 12:39:47 +08:00 · 2025-08-28 12:39:47 +08:00 · a8249acc18
commit a8249acc18
parent 125a7818e0
2952 changed files with 105460 additions and 17649 deletions
--- a/examples/unittests/README_zh.md
+++ b/examples/unittests/README_zh.md
@ -22,7 +22,8 @@ python examples/unittests/run_eval.py \
    --timeout 6000 \
    --api_mode completion \
    --max_retries 16 \
-    --max_retrying_delay 60
+    --max_retrying_delay 60 \
+    --resume
 ```

 ---
@ -46,6 +47,8 @@ python examples/unittests/run_eval.py \
 | `--sys_prompt`          | str        | `"You are an expert reasoner..."`        | 系统提示内容，仅在 `api_mode` 为 `chat_completion` 时生效。          |
 | `--max_retries`         | int        | `16`                                     | 单个请求失败重试次数。                                              |
 | `--max_retrying_delay`  | int        | `60`                                     | 最大重试延迟时间（秒）。                           |
+| `--resume`              | bool        | `true`                                     | 是否从上次中断的位置继续执行。                                     |
+| `--check_model_url`     | bool        | `true`                                     | 在开始评测前检查模型服务的 URL 是否可用。                             |

 ##### 参数关系
 - `--api_mode`为`chat_completion`时，`--sys_prompt`参数才有效。