Commit graph

688 commits

Author SHA1 Message Date
teknium
00a0f5397a Merge branch 'add_reasoning_handling_draft' of https://github.com/NousResearch/atropos into add_reasoning_handling_draft 2026-01-14 13:38:08 +00:00
teknium
3a854cc3af fix linter 2026-01-14 13:38:04 +00:00
pre-commit-ci[bot]
79a55ff186 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-13 07:30:33 +00:00
teknium
2a7dd49328 Merge branch 'add_reasoning_handling_draft' of https://github.com/NousResearch/atropos into add_reasoning_handling_draft 2026-01-13 07:29:48 +00:00
teknium
b33cb7f943 A bit more updates for robustness 2026-01-13 07:29:43 +00:00
Teknium
837fc237ee
Merge branch 'main' into add_reasoning_handling_draft 2026-01-12 09:45:38 -08:00
PLippmann
5a130a3a5b Quote fix 2026-01-06 15:14:26 +01:00
PLippmann
7d8123a526 Missing initialization 2026-01-06 15:14:26 +01:00
PLippmann
c927794248 Add SQL Query Generation Environment 2026-01-06 15:14:26 +01:00
Teknium
11ebecd93f
Merge branch 'main' into add-eval-runner 2026-01-05 15:46:39 -08:00
teknium
cb6bf37e68 update name of eval example 2026-01-05 23:46:27 +00:00
teknium
1fa5c3eee4 lint it 2026-01-05 23:37:37 +00:00
Teknium
3ef206e013
Merge branch 'main' into reverse-text-env 2026-01-05 15:33:43 -08:00
Teknium
46beb71e4b
Merge branch 'main' into gsm8k-fix 2026-01-03 03:58:51 -08:00
Teknium
ab75ce6e03
Merge branch 'main' into feat/tool_use_mtgrpo 2026-01-03 03:08:21 -08:00
teknium
00ab64a0f4 lint it 2026-01-02 14:17:56 +00:00
pre-commit-ci[bot]
f71873b0ea [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-02 14:14:11 +00:00
teknium
0bb38e79ef big update for letter counting 2026-01-02 14:10:02 +00:00
pre-commit-ci[bot]
3cb09259df [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-31 21:05:45 +00:00
teknium
b9dac318ee fix lintings 2025-12-31 21:04:40 +00:00
Teknium
c980157a5a
Merge branch 'main' into joe-public-branch 2025-12-31 12:41:24 -08:00
teknium
747fbc9285 fix linting 2025-12-30 11:56:21 +00:00
teknium
62fa51240c Add support for reasoning models and their variety of providers/endpoints 2025-12-30 00:23:00 +00:00
Teknium
1c306d3b17
Merge pull request #294 from NousResearch/port_many_evals
Port many benchmarks into atropos
2025-12-28 04:34:46 -08:00
pre-commit-ci[bot]
f7fe9d612b [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-28 12:32:56 +00:00
teknium
b912983e5e Merge branch 'port_many_evals' of https://github.com/NousResearch/atropos into port_many_evals 2025-12-28 12:32:14 +00:00
teknium
c3f7c8dea6 final 2025-12-28 12:32:12 +00:00
pre-commit-ci[bot]
55e50f5782 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-28 12:29:37 +00:00
teknium
b975a315fe linters 2025-12-28 12:28:52 +00:00
pre-commit-ci[bot]
1d4275d441 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-28 04:12:17 +00:00
teknium
ea6db6fe92 Merge branch 'port_many_evals' of https://github.com/NousResearch/atropos into port_many_evals 2025-12-28 04:11:32 +00:00
teknium
bcfbd647e3 fix some bugs 2025-12-28 04:09:34 +00:00
pre-commit-ci[bot]
52110f3fb4 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-28 01:45:06 +00:00
teknium
830a129655 add phybench eval 2025-12-28 01:44:20 +00:00
pre-commit-ci[bot]
cd733d4285 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-26 16:50:26 +00:00
interstellarninja
6e2bdd2a39 refactoring mtgrpo turn level advantage server 2025-12-26 22:33:35 +05:45
pre-commit-ci[bot]
60188d07d3 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-26 17:06:20 +05:45
interstellarninja
fa26c32cde implmenting scenarios modes: single, multistep, multiturn 2025-12-26 17:04:12 +05:45
interstellarninja
eabb91d254 config for sequential tools & normalizing python literals 2025-12-26 17:04:12 +05:45
interstellarninja
a4cdf80e4a BaseEnvConfig subclass for experimental vars 2025-12-26 17:04:12 +05:45
interstellarninja
2aa950a5a8 Add MT-GRPO turn-level advantage environment
- Implement turn-level credit assignment following MT-GRPO paper
  - Custom reward computation: turn-level + outcome-level rewards
  - Per-token advantage assignment compatible with existing GRPO trainer
  - Configurable lambda parameter for turn/outcome advantage weighting
2025-12-26 17:04:12 +05:45
pre-commit-ci[bot]
d04f8c0ae7 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-25 09:54:56 +00:00
teknium
8435371d80 linty 2025-12-25 09:54:11 +00:00
pre-commit-ci[bot]
269fb71713 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-25 09:49:33 +00:00
teknium
9e9f1cd88e Merge branch 'port_many_evals' of https://github.com/NousResearch/atropos into port_many_evals 2025-12-25 09:48:50 +00:00
teknium
c871f6a56a fix eval ctx len 2025-12-25 09:48:47 +00:00
pre-commit-ci[bot]
6bb6a5976d [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-24 23:37:21 +00:00
teknium
85296c519e hopefully final linter fixes lol 2025-12-24 23:36:36 +00:00
pre-commit-ci[bot]
d932d9c03b [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-24 21:27:53 +00:00
JoeLi12345
3348e31a29 readme 2025-12-24 21:22:07 +00:00