Commit graph

1320 commits

Author SHA1 Message Date
teknium
c3f7c8dea6 final 2025-12-28 12:32:12 +00:00
pre-commit-ci[bot]
55e50f5782 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-28 12:29:37 +00:00
teknium
b975a315fe linters 2025-12-28 12:28:52 +00:00
pre-commit-ci[bot]
1d4275d441 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-28 04:12:17 +00:00
teknium
ea6db6fe92 Merge branch 'port_many_evals' of https://github.com/NousResearch/atropos into port_many_evals 2025-12-28 04:11:32 +00:00
teknium
bcfbd647e3 fix some bugs 2025-12-28 04:09:34 +00:00
pre-commit-ci[bot]
52110f3fb4 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-28 01:45:06 +00:00
teknium
830a129655 add phybench eval 2025-12-28 01:44:20 +00:00
pre-commit-ci[bot]
cd733d4285 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-26 16:50:26 +00:00
interstellarninja
6e2bdd2a39 refactoring mtgrpo turn level advantage server 2025-12-26 22:33:35 +05:45
pre-commit-ci[bot]
60188d07d3 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-26 17:06:20 +05:45
interstellarninja
fa26c32cde implmenting scenarios modes: single, multistep, multiturn 2025-12-26 17:04:12 +05:45
interstellarninja
eabb91d254 config for sequential tools & normalizing python literals 2025-12-26 17:04:12 +05:45
interstellarninja
a4cdf80e4a BaseEnvConfig subclass for experimental vars 2025-12-26 17:04:12 +05:45
interstellarninja
2aa950a5a8 Add MT-GRPO turn-level advantage environment
- Implement turn-level credit assignment following MT-GRPO paper
  - Custom reward computation: turn-level + outcome-level rewards
  - Per-token advantage assignment compatible with existing GRPO trainer
  - Configurable lambda parameter for turn/outcome advantage weighting
2025-12-26 17:04:12 +05:45
Teknium
d51eaba93c
Merge pull request #292 from tonnycro/fix-links
fix: fix broken links to files
2025-12-25 22:15:46 -08:00
Teknium
b8117a40fc
Merge pull request #291 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
2025-12-25 22:15:04 -08:00
pre-commit-ci[bot]
d04f8c0ae7 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-25 09:54:56 +00:00
teknium
8435371d80 linty 2025-12-25 09:54:11 +00:00
pre-commit-ci[bot]
269fb71713 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-25 09:49:33 +00:00
teknium
9e9f1cd88e Merge branch 'port_many_evals' of https://github.com/NousResearch/atropos into port_many_evals 2025-12-25 09:48:50 +00:00
teknium
c871f6a56a fix eval ctx len 2025-12-25 09:48:47 +00:00
pre-commit-ci[bot]
6bb6a5976d [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-24 23:37:21 +00:00
teknium
85296c519e hopefully final linter fixes lol 2025-12-24 23:36:36 +00:00
pre-commit-ci[bot]
d932d9c03b [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-24 21:27:53 +00:00
JoeLi12345
3348e31a29 readme 2025-12-24 21:22:07 +00:00
JoeLi12345
2fd888cdb0 lcb coding rl environment 2025-12-24 20:57:49 +00:00
pre-commit-ci[bot]
67869c3a79 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-24 11:05:18 +00:00
teknium
148333a23b Merge branch 'port_many_evals' of https://github.com/NousResearch/atropos into port_many_evals 2025-12-24 11:04:35 +00:00
teknium
abdda3978a more linter nonsense 2025-12-24 11:04:33 +00:00
pre-commit-ci[bot]
fbf1a26559 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-24 10:54:58 +00:00
teknium
f18d46549d fix linter errors 2025-12-24 10:53:45 +00:00
pre-commit-ci[bot]
afab28dfa9 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-24 10:48:24 +00:00
teknium
ef9c0c3699 Port many benchmarks into atropos 2025-12-24 10:23:16 +00:00
kit
6eb2e49618 added missing logprob 2025-12-24 00:05:10 +00:00
Tonny
a25e299c83
Update README.md 2025-12-22 21:53:28 +03:00
Tonny
8da2b5ae29
Update README.md 2025-12-22 21:50:53 +03:00
Tonny
e0b870f28e
Update README.md 2025-12-22 21:50:39 +03:00
Tonny
1761f08211
Update README.md 2025-12-22 21:50:11 +03:00
Tonny
40f3c1f7e7
Update README.md 2025-12-22 21:49:55 +03:00
pre-commit-ci[bot]
d1bb1eed19
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.14.9 → v0.14.10](https://github.com/astral-sh/ruff-pre-commit/compare/v0.14.9...v0.14.10)
2025-12-22 16:40:20 +00:00
Dakota
8ec5066998 add eval runner 2025-12-19 19:56:59 -06:00
dmahan93
5e962e682e
Merge pull request #283 from juleennn/main
docs: fix dead links
2025-12-19 07:33:35 -08:00
dmahan93
cb5929a245
Merge pull request #288 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
2025-12-19 07:32:29 -08:00
pre-commit-ci[bot]
d8c83dd5de [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-15 17:52:31 +00:00
GHOryy5
4c4aba108c
Prevent hangs in kernel evaluation by bounding worker waits 2025-12-15 20:50:03 +03:00
pre-commit-ci[bot]
ce61dfadb3
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.14.8 → v0.14.9](https://github.com/astral-sh/ruff-pre-commit/compare/v0.14.8...v0.14.9)
2025-12-15 16:41:48 +00:00
dmahan93
405efa8302
Merge pull request #287 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
2025-12-08 11:24:48 -08:00
pre-commit-ci[bot]
255ddf978c
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/psf/black-pre-commit-mirror: 25.11.0 → 25.12.0](https://github.com/psf/black-pre-commit-mirror/compare/25.11.0...25.12.0)
- [github.com/astral-sh/ruff-pre-commit: v0.14.7 → v0.14.8](https://github.com/astral-sh/ruff-pre-commit/compare/v0.14.7...v0.14.8)
2025-12-08 16:41:53 +00:00
Teknium
063ec33373
Merge pull request #286 from NousResearch/README_updates_tinker
README updates for Tinker Integration
2025-12-03 16:42:58 -08:00