Commit graph

9 commits

Author SHA1 Message Date
pre-commit-ci[bot]
7907ffd0ad [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-12 05:05:11 +00:00
balyan.sid@gmail.com
24b4488c60 clean up eval, pin verifiers version 2026-01-12 10:34:05 +05:30
pre-commit-ci[bot]
d98bc6d9fc [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-12 10:34:05 +05:30
balyan.sid@gmail.com
cf636595d2 rework server and eval for rl rollout. add in asyncmanagedserver for
verifiers
2026-01-12 10:34:05 +05:30
pre-commit-ci[bot]
3449a4c23d [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-12 10:34:05 +05:30
balyan.sid@gmail.com
5b09ad86f4 update readme, add sft-datagen to verifiers_server 2026-01-09 19:20:41 +05:30
balyan.sid@gmail.com
636715bb08 add wandb to eval 2026-01-09 16:51:19 +05:30
balyan.sid@gmail.com
dda85430da fix docstrings 2026-01-09 16:25:44 +05:30
balyan.sid@gmail.com
9d5cd2b593 fix: improve verifiers environments consistency and correctness
- verifiers_server.py: consistent dataset column selection for train/test,
  remove redundant comments, preserve float precision for scores
- verifiers_eval.py: add env_config_cls, fix constructor signature to match
  BaseEnv (slurm bool), make stub methods raise NotImplementedError
2026-01-09 16:21:12 +05:30