balyan.sid@gmail.com
|
4ba69d3a80
|
revert to using evalbase
|
2026-01-23 23:41:32 +05:30 |
|
balyan.sid@gmail.com
|
5a20abdce7
|
switch eval to use managed server adapter impl. moved managed server
adapter
|
2026-01-23 23:26:29 +05:30 |
|
balyan.sid@gmail.com
|
c56af35eaa
|
switch to evalbase for verifiers_eval.py
|
2026-01-15 11:34:40 +05:30 |
|
pre-commit-ci[bot]
|
7907ffd0ad
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-01-12 05:05:11 +00:00 |
|
balyan.sid@gmail.com
|
24b4488c60
|
clean up eval, pin verifiers version
|
2026-01-12 10:34:05 +05:30 |
|
pre-commit-ci[bot]
|
d98bc6d9fc
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-01-12 10:34:05 +05:30 |
|
balyan.sid@gmail.com
|
cf636595d2
|
rework server and eval for rl rollout. add in asyncmanagedserver for
verifiers
|
2026-01-12 10:34:05 +05:30 |
|
pre-commit-ci[bot]
|
3449a4c23d
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-01-12 10:34:05 +05:30 |
|
balyan.sid@gmail.com
|
5b09ad86f4
|
update readme, add sft-datagen to verifiers_server
|
2026-01-09 19:20:41 +05:30 |
|
balyan.sid@gmail.com
|
636715bb08
|
add wandb to eval
|
2026-01-09 16:51:19 +05:30 |
|
balyan.sid@gmail.com
|
dda85430da
|
fix docstrings
|
2026-01-09 16:25:44 +05:30 |
|
balyan.sid@gmail.com
|
9d5cd2b593
|
fix: improve verifiers environments consistency and correctness
- verifiers_server.py: consistent dataset column selection for train/test,
remove redundant comments, preserve float precision for scores
- verifiers_eval.py: add env_config_cls, fix constructor signature to match
BaseEnv (slurm bool), make stub methods raise NotImplementedError
|
2026-01-09 16:21:12 +05:30 |
|