teknium
|
cb6bf37e68
|
update name of eval example
|
2026-01-05 23:46:27 +00:00 |
|
teknium
|
1fa5c3eee4
|
lint it
|
2026-01-05 23:37:37 +00:00 |
|
Teknium
|
3ef206e013
|
Merge branch 'main' into reverse-text-env
|
2026-01-05 15:33:43 -08:00 |
|
Teknium
|
46beb71e4b
|
Merge branch 'main' into gsm8k-fix
|
2026-01-03 03:58:51 -08:00 |
|
Teknium
|
ab75ce6e03
|
Merge branch 'main' into feat/tool_use_mtgrpo
|
2026-01-03 03:08:21 -08:00 |
|
teknium
|
00ab64a0f4
|
lint it
|
2026-01-02 14:17:56 +00:00 |
|
pre-commit-ci[bot]
|
f71873b0ea
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-01-02 14:14:11 +00:00 |
|
teknium
|
0bb38e79ef
|
big update for letter counting
|
2026-01-02 14:10:02 +00:00 |
|
pre-commit-ci[bot]
|
3cb09259df
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-31 21:05:45 +00:00 |
|
teknium
|
b9dac318ee
|
fix lintings
|
2025-12-31 21:04:40 +00:00 |
|
Teknium
|
c980157a5a
|
Merge branch 'main' into joe-public-branch
|
2025-12-31 12:41:24 -08:00 |
|
teknium
|
747fbc9285
|
fix linting
|
2025-12-30 11:56:21 +00:00 |
|
teknium
|
62fa51240c
|
Add support for reasoning models and their variety of providers/endpoints
|
2025-12-30 00:23:00 +00:00 |
|
Teknium
|
1c306d3b17
|
Merge pull request #294 from NousResearch/port_many_evals
Port many benchmarks into atropos
|
2025-12-28 04:34:46 -08:00 |
|
pre-commit-ci[bot]
|
f7fe9d612b
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-28 12:32:56 +00:00 |
|
teknium
|
b912983e5e
|
Merge branch 'port_many_evals' of https://github.com/NousResearch/atropos into port_many_evals
|
2025-12-28 12:32:14 +00:00 |
|
teknium
|
c3f7c8dea6
|
final
|
2025-12-28 12:32:12 +00:00 |
|
pre-commit-ci[bot]
|
55e50f5782
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-28 12:29:37 +00:00 |
|
teknium
|
b975a315fe
|
linters
|
2025-12-28 12:28:52 +00:00 |
|
pre-commit-ci[bot]
|
1d4275d441
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-28 04:12:17 +00:00 |
|
teknium
|
ea6db6fe92
|
Merge branch 'port_many_evals' of https://github.com/NousResearch/atropos into port_many_evals
|
2025-12-28 04:11:32 +00:00 |
|
teknium
|
bcfbd647e3
|
fix some bugs
|
2025-12-28 04:09:34 +00:00 |
|
pre-commit-ci[bot]
|
52110f3fb4
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-28 01:45:06 +00:00 |
|
teknium
|
830a129655
|
add phybench eval
|
2025-12-28 01:44:20 +00:00 |
|
pre-commit-ci[bot]
|
cd733d4285
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-26 16:50:26 +00:00 |
|
interstellarninja
|
6e2bdd2a39
|
refactoring mtgrpo turn level advantage server
|
2025-12-26 22:33:35 +05:45 |
|
pre-commit-ci[bot]
|
60188d07d3
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-26 17:06:20 +05:45 |
|
interstellarninja
|
fa26c32cde
|
implmenting scenarios modes: single, multistep, multiturn
|
2025-12-26 17:04:12 +05:45 |
|
interstellarninja
|
eabb91d254
|
config for sequential tools & normalizing python literals
|
2025-12-26 17:04:12 +05:45 |
|
interstellarninja
|
a4cdf80e4a
|
BaseEnvConfig subclass for experimental vars
|
2025-12-26 17:04:12 +05:45 |
|
interstellarninja
|
2aa950a5a8
|
Add MT-GRPO turn-level advantage environment
- Implement turn-level credit assignment following MT-GRPO paper
- Custom reward computation: turn-level + outcome-level rewards
- Per-token advantage assignment compatible with existing GRPO trainer
- Configurable lambda parameter for turn/outcome advantage weighting
|
2025-12-26 17:04:12 +05:45 |
|
pre-commit-ci[bot]
|
d04f8c0ae7
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-25 09:54:56 +00:00 |
|
teknium
|
8435371d80
|
linty
|
2025-12-25 09:54:11 +00:00 |
|
pre-commit-ci[bot]
|
269fb71713
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-25 09:49:33 +00:00 |
|
teknium
|
9e9f1cd88e
|
Merge branch 'port_many_evals' of https://github.com/NousResearch/atropos into port_many_evals
|
2025-12-25 09:48:50 +00:00 |
|
teknium
|
c871f6a56a
|
fix eval ctx len
|
2025-12-25 09:48:47 +00:00 |
|
pre-commit-ci[bot]
|
6bb6a5976d
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-24 23:37:21 +00:00 |
|
teknium
|
85296c519e
|
hopefully final linter fixes lol
|
2025-12-24 23:36:36 +00:00 |
|
pre-commit-ci[bot]
|
d932d9c03b
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-24 21:27:53 +00:00 |
|
JoeLi12345
|
3348e31a29
|
readme
|
2025-12-24 21:22:07 +00:00 |
|
JoeLi12345
|
2fd888cdb0
|
lcb coding rl environment
|
2025-12-24 20:57:49 +00:00 |
|
pre-commit-ci[bot]
|
67869c3a79
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-24 11:05:18 +00:00 |
|
teknium
|
148333a23b
|
Merge branch 'port_many_evals' of https://github.com/NousResearch/atropos into port_many_evals
|
2025-12-24 11:04:35 +00:00 |
|
teknium
|
abdda3978a
|
more linter nonsense
|
2025-12-24 11:04:33 +00:00 |
|
pre-commit-ci[bot]
|
fbf1a26559
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-24 10:54:58 +00:00 |
|
teknium
|
f18d46549d
|
fix linter errors
|
2025-12-24 10:53:45 +00:00 |
|
pre-commit-ci[bot]
|
afab28dfa9
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-12-24 10:48:24 +00:00 |
|
teknium
|
ef9c0c3699
|
Port many benchmarks into atropos
|
2025-12-24 10:23:16 +00:00 |
|
kit
|
6eb2e49618
|
added missing logprob
|
2025-12-24 00:05:10 +00:00 |
|
Tonny
|
a25e299c83
|
Update README.md
|
2025-12-22 21:53:28 +03:00 |
|