Teknium
|
9047f03109
|
Merge pull request #297 from NousResearch/add_reasoning_handling_draft
Add support for reasoning models and their variety of providers/endpo…
|
2026-01-15 19:43:17 -08:00 |
|
crStiv
|
7e12fa015c
|
Update README.md
|
2026-01-15 16:09:46 +02:00 |
|
crStiv
|
b624cbd246
|
Update plot.py
|
2026-01-15 16:09:00 +02:00 |
|
crStiv
|
14b82ae6cc
|
Update configs.py
|
2026-01-15 16:07:00 +02:00 |
|
crStiv
|
941fadd73c
|
Update run.py
|
2026-01-15 16:06:43 +02:00 |
|
crStiv
|
20992ed5d5
|
Update hpo.py
|
2026-01-15 16:05:27 +02:00 |
|
crStiv
|
d2fbe43e7e
|
Update lcb_modal_endpoint.py
|
2026-01-15 16:00:03 +02:00 |
|
balyan.sid@gmail.com
|
c56af35eaa
|
switch to evalbase for verifiers_eval.py
|
2026-01-15 11:34:40 +05:30 |
|
teknium
|
00a0f5397a
|
Merge branch 'add_reasoning_handling_draft' of https://github.com/NousResearch/atropos into add_reasoning_handling_draft
|
2026-01-14 13:38:08 +00:00 |
|
teknium
|
3a854cc3af
|
fix linter
|
2026-01-14 13:38:04 +00:00 |
|
balyan.sid@gmail.com
|
6a27e88023
|
use managed server
|
2026-01-14 17:09:01 +05:30 |
|
balyan.sid@gmail.com
|
32320512e8
|
update verifiers_server to use tokenizer_for_trainer
|
2026-01-13 15:00:54 +05:30 |
|
pre-commit-ci[bot]
|
79a55ff186
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-01-13 07:30:33 +00:00 |
|
teknium
|
2a7dd49328
|
Merge branch 'add_reasoning_handling_draft' of https://github.com/NousResearch/atropos into add_reasoning_handling_draft
|
2026-01-13 07:29:48 +00:00 |
|
teknium
|
b33cb7f943
|
A bit more updates for robustness
|
2026-01-13 07:29:43 +00:00 |
|
Teknium
|
837fc237ee
|
Merge branch 'main' into add_reasoning_handling_draft
|
2026-01-12 09:45:38 -08:00 |
|
balyan.sid@gmail.com
|
a1d1e7d7fe
|
fix env_args, dataset/prompt loading
|
2026-01-12 10:39:43 +05:30 |
|
pre-commit-ci[bot]
|
7907ffd0ad
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-01-12 05:05:11 +00:00 |
|
balyan.sid@gmail.com
|
9db6c0d1ed
|
added better wandb logging
|
2026-01-12 10:34:05 +05:30 |
|
balyan.sid@gmail.com
|
dceb1d8fd8
|
parallelize verifiers_server: use generate() for SFT, parallel
ManagedServer contexts for RL
|
2026-01-12 10:34:05 +05:30 |
|
balyan.sid@gmail.com
|
24b4488c60
|
clean up eval, pin verifiers version
|
2026-01-12 10:34:05 +05:30 |
|
pre-commit-ci[bot]
|
d98bc6d9fc
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-01-12 10:34:05 +05:30 |
|
balyan.sid@gmail.com
|
cf636595d2
|
rework server and eval for rl rollout. add in asyncmanagedserver for
verifiers
|
2026-01-12 10:34:05 +05:30 |
|
pre-commit-ci[bot]
|
3449a4c23d
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-01-12 10:34:05 +05:30 |
|
balyan.sid@gmail.com
|
5b09ad86f4
|
update readme, add sft-datagen to verifiers_server
|
2026-01-09 19:20:41 +05:30 |
|
balyan.sid@gmail.com
|
636715bb08
|
add wandb to eval
|
2026-01-09 16:51:19 +05:30 |
|
balyan.sid@gmail.com
|
dda85430da
|
fix docstrings
|
2026-01-09 16:25:44 +05:30 |
|
balyan.sid@gmail.com
|
9d5cd2b593
|
fix: improve verifiers environments consistency and correctness
- verifiers_server.py: consistent dataset column selection for train/test,
remove redundant comments, preserve float precision for scores
- verifiers_eval.py: add env_config_cls, fix constructor signature to match
BaseEnv (slurm bool), make stub methods raise NotImplementedError
|
2026-01-09 16:21:12 +05:30 |
|
balyan.sid@gmail.com
|
ed826de724
|
wip: verifiers integration
|
2026-01-09 14:21:03 +05:30 |
|
PLippmann
|
5a130a3a5b
|
Quote fix
|
2026-01-06 15:14:26 +01:00 |
|
PLippmann
|
7d8123a526
|
Missing initialization
|
2026-01-06 15:14:26 +01:00 |
|
PLippmann
|
c927794248
|
Add SQL Query Generation Environment
|
2026-01-06 15:14:26 +01:00 |
|
Teknium
|
11ebecd93f
|
Merge branch 'main' into add-eval-runner
|
2026-01-05 15:46:39 -08:00 |
|
teknium
|
cb6bf37e68
|
update name of eval example
|
2026-01-05 23:46:27 +00:00 |
|
teknium
|
1fa5c3eee4
|
lint it
|
2026-01-05 23:37:37 +00:00 |
|
Teknium
|
3ef206e013
|
Merge branch 'main' into reverse-text-env
|
2026-01-05 15:33:43 -08:00 |
|
Teknium
|
46beb71e4b
|
Merge branch 'main' into gsm8k-fix
|
2026-01-03 03:58:51 -08:00 |
|
Teknium
|
ab75ce6e03
|
Merge branch 'main' into feat/tool_use_mtgrpo
|
2026-01-03 03:08:21 -08:00 |
|
teknium
|
00ab64a0f4
|
lint it
|
2026-01-02 14:17:56 +00:00 |
|
pre-commit-ci[bot]
|
f71873b0ea
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-01-02 14:14:11 +00:00 |
|
teknium
|
0bb38e79ef
|
big update for letter counting
|
2026-01-02 14:10:02 +00:00 |
|
pre-commit-ci[bot]
|
3cb09259df
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-31 21:05:45 +00:00 |
|
teknium
|
b9dac318ee
|
fix lintings
|
2025-12-31 21:04:40 +00:00 |
|
Teknium
|
c980157a5a
|
Merge branch 'main' into joe-public-branch
|
2025-12-31 12:41:24 -08:00 |
|
teknium
|
747fbc9285
|
fix linting
|
2025-12-30 11:56:21 +00:00 |
|
teknium
|
62fa51240c
|
Add support for reasoning models and their variety of providers/endpoints
|
2025-12-30 00:23:00 +00:00 |
|
Teknium
|
1c306d3b17
|
Merge pull request #294 from NousResearch/port_many_evals
Port many benchmarks into atropos
|
2025-12-28 04:34:46 -08:00 |
|
pre-commit-ci[bot]
|
f7fe9d612b
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-28 12:32:56 +00:00 |
|
teknium
|
b912983e5e
|
Merge branch 'port_many_evals' of https://github.com/NousResearch/atropos into port_many_evals
|
2025-12-28 12:32:14 +00:00 |
|
teknium
|
c3f7c8dea6
|
final
|
2025-12-28 12:32:12 +00:00 |
|