update to tech report version (#10)

* feat(run_eval): add checkpoint resume functionality and update example documentation;
- update new bootcamp benchmark dataset

* refactor(data_pipeline): optimize data generation pipeline; add multiple preset configurations for data generation

* docs: update bootcamp list and add new scripts

- Update Fulllist_InternBootcamp.md with new bootcamps and categories
- Add new scripts to .gitignore:
  - examples/pipelines/filter_autogen_configs.py
  - examples/pipelines/quickgen_data_configs_from_eval_meta.py
- Update dependencies in setup.py:
  - Add scipy and scikit-learn

* refactor(internbootcamp): update bootcamp modules and improve error handling

- Update import statements in __init__.py files
- Add timestamp to target directory name in verl_data_preprocess.py
- Improve error handling and scoring logic in bootcamp_judger.py
- Remove unnecessary comments and update puzzle descriptions in multiple files
This commit is contained in:
Yongkang Chen 2025-08-28 12:39:47 +08:00 committed by GitHub
parent 125a7818e0
commit a8249acc18
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
2952 changed files with 105460 additions and 17649 deletions

View file

@ -22,7 +22,8 @@ python examples/unittests/run_eval.py \
--timeout 6000 \
--api_mode completion \
--max_retries 16 \
--max_retrying_delay 60
--max_retrying_delay 60 \
--resume
```
---
@ -46,6 +47,8 @@ python examples/unittests/run_eval.py \
| `--sys_prompt` | str | `"You are an expert reasoner..."` | 系统提示内容,仅在 `api_mode``chat_completion` 时生效。 |
| `--max_retries` | int | `16` | 单个请求失败重试次数。 |
| `--max_retrying_delay` | int | `60` | 最大重试延迟时间(秒)。 |
| `--resume` | bool | `true` | 是否从上次中断的位置继续执行。 |
| `--check_model_url` | bool | `true` | 在开始评测前检查模型服务的 URL 是否可用。 |
##### 参数关系
- `--api_mode``chat_completion`时,`--sys_prompt`参数才有效。