Commit graph

763 commits

Author SHA1 Message Date
Teknium
9047f03109
Merge pull request #297 from NousResearch/add_reasoning_handling_draft
Add support for reasoning models and their variety of providers/endpo…
2026-01-15 19:43:17 -08:00
crStiv
7e12fa015c
Update README.md 2026-01-15 16:09:46 +02:00
crStiv
b624cbd246
Update plot.py 2026-01-15 16:09:00 +02:00
crStiv
14b82ae6cc
Update configs.py 2026-01-15 16:07:00 +02:00
crStiv
941fadd73c
Update run.py 2026-01-15 16:06:43 +02:00
crStiv
20992ed5d5
Update hpo.py 2026-01-15 16:05:27 +02:00
crStiv
d2fbe43e7e
Update lcb_modal_endpoint.py 2026-01-15 16:00:03 +02:00
balyan.sid@gmail.com
c56af35eaa switch to evalbase for verifiers_eval.py 2026-01-15 11:34:40 +05:30
teknium
00a0f5397a Merge branch 'add_reasoning_handling_draft' of https://github.com/NousResearch/atropos into add_reasoning_handling_draft 2026-01-14 13:38:08 +00:00
teknium
3a854cc3af fix linter 2026-01-14 13:38:04 +00:00
balyan.sid@gmail.com
6a27e88023 use managed server 2026-01-14 17:09:01 +05:30
balyan.sid@gmail.com
32320512e8 update verifiers_server to use tokenizer_for_trainer 2026-01-13 15:00:54 +05:30
pre-commit-ci[bot]
79a55ff186 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-13 07:30:33 +00:00
teknium
2a7dd49328 Merge branch 'add_reasoning_handling_draft' of https://github.com/NousResearch/atropos into add_reasoning_handling_draft 2026-01-13 07:29:48 +00:00
teknium
b33cb7f943 A bit more updates for robustness 2026-01-13 07:29:43 +00:00
Teknium
837fc237ee
Merge branch 'main' into add_reasoning_handling_draft 2026-01-12 09:45:38 -08:00
balyan.sid@gmail.com
a1d1e7d7fe fix env_args, dataset/prompt loading 2026-01-12 10:39:43 +05:30
pre-commit-ci[bot]
7907ffd0ad [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-12 05:05:11 +00:00
balyan.sid@gmail.com
9db6c0d1ed added better wandb logging 2026-01-12 10:34:05 +05:30
balyan.sid@gmail.com
dceb1d8fd8 parallelize verifiers_server: use generate() for SFT, parallel
ManagedServer contexts for RL
2026-01-12 10:34:05 +05:30
balyan.sid@gmail.com
24b4488c60 clean up eval, pin verifiers version 2026-01-12 10:34:05 +05:30
pre-commit-ci[bot]
d98bc6d9fc [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-12 10:34:05 +05:30
balyan.sid@gmail.com
cf636595d2 rework server and eval for rl rollout. add in asyncmanagedserver for
verifiers
2026-01-12 10:34:05 +05:30
pre-commit-ci[bot]
3449a4c23d [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-12 10:34:05 +05:30
balyan.sid@gmail.com
5b09ad86f4 update readme, add sft-datagen to verifiers_server 2026-01-09 19:20:41 +05:30
balyan.sid@gmail.com
636715bb08 add wandb to eval 2026-01-09 16:51:19 +05:30
balyan.sid@gmail.com
dda85430da fix docstrings 2026-01-09 16:25:44 +05:30
balyan.sid@gmail.com
9d5cd2b593 fix: improve verifiers environments consistency and correctness
- verifiers_server.py: consistent dataset column selection for train/test,
  remove redundant comments, preserve float precision for scores
- verifiers_eval.py: add env_config_cls, fix constructor signature to match
  BaseEnv (slurm bool), make stub methods raise NotImplementedError
2026-01-09 16:21:12 +05:30
balyan.sid@gmail.com
ed826de724 wip: verifiers integration 2026-01-09 14:21:03 +05:30
PLippmann
5a130a3a5b Quote fix 2026-01-06 15:14:26 +01:00
PLippmann
7d8123a526 Missing initialization 2026-01-06 15:14:26 +01:00
PLippmann
c927794248 Add SQL Query Generation Environment 2026-01-06 15:14:26 +01:00
Teknium
11ebecd93f
Merge branch 'main' into add-eval-runner 2026-01-05 15:46:39 -08:00
teknium
cb6bf37e68 update name of eval example 2026-01-05 23:46:27 +00:00
teknium
1fa5c3eee4 lint it 2026-01-05 23:37:37 +00:00
Teknium
3ef206e013
Merge branch 'main' into reverse-text-env 2026-01-05 15:33:43 -08:00
Teknium
46beb71e4b
Merge branch 'main' into gsm8k-fix 2026-01-03 03:58:51 -08:00
Teknium
ab75ce6e03
Merge branch 'main' into feat/tool_use_mtgrpo 2026-01-03 03:08:21 -08:00
teknium
00ab64a0f4 lint it 2026-01-02 14:17:56 +00:00
pre-commit-ci[bot]
f71873b0ea [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-02 14:14:11 +00:00
teknium
0bb38e79ef big update for letter counting 2026-01-02 14:10:02 +00:00
pre-commit-ci[bot]
3cb09259df [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-31 21:05:45 +00:00
teknium
b9dac318ee fix lintings 2025-12-31 21:04:40 +00:00
Teknium
c980157a5a
Merge branch 'main' into joe-public-branch 2025-12-31 12:41:24 -08:00
teknium
747fbc9285 fix linting 2025-12-30 11:56:21 +00:00
teknium
62fa51240c Add support for reasoning models and their variety of providers/endpoints 2025-12-30 00:23:00 +00:00
Teknium
1c306d3b17
Merge pull request #294 from NousResearch/port_many_evals
Port many benchmarks into atropos
2025-12-28 04:34:46 -08:00
pre-commit-ci[bot]
f7fe9d612b [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-28 12:32:56 +00:00
teknium
b912983e5e Merge branch 'port_many_evals' of https://github.com/NousResearch/atropos into port_many_evals 2025-12-28 12:32:14 +00:00
teknium
c3f7c8dea6 final 2025-12-28 12:32:12 +00:00