Commit graph

746 commits

Author SHA1 Message Date
dmahan93
9dbef4e552
Merge pull request #210 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
2025-07-14 17:59:05 -05:00
pre-commit-ci[bot]
110066a700
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.12.2 → v0.12.3](https://github.com/astral-sh/ruff-pre-commit/compare/v0.12.2...v0.12.3)
2025-07-14 16:36:32 +00:00
hjc-puro
75a4264f8d
Merge pull request #208 from NousResearch/2025-07-08-evals
Add `evaluate_log` method, gsm8k example
2025-07-12 06:45:05 +08:00
hjc-puro
6e9baaf9d8 table 2025-07-11 09:52:19 +00:00
hjc-puro
72210cf4ad rename fn 2025-07-11 04:04:55 +00:00
hjc-puro
352e1b8f88 comments 2025-07-11 03:55:16 +00:00
hjc-puro
d133ba3867 comment 2025-07-11 03:54:03 +00:00
hjc-puro
ccb8eaf230 move table to util 2025-07-11 03:52:24 +00:00
hjc-puro
5e61331360 simplify schema 2025-07-11 03:49:49 +00:00
hjc-puro
0d4ce37b73 add eval types 2025-07-11 03:36:55 +00:00
hjc-puro
290e087fc5 remove some imports 2025-07-11 03:25:10 +00:00
hjc-puro
68da3809e2 move table to display util 2025-07-11 02:06:56 +00:00
hjc-puro
3e08c6d788 simplify schema 2025-07-11 00:52:09 +00:00
hjc-puro
b06332623d move time import 2025-07-11 00:45:24 +00:00
Philip Lippmann
17faebae03
Enhanced Pydantic Schema Following Environment with Dynamic Error Introduction and Editing Task Support (#185)
* New JSON env and documentation

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Removed seperate JSON generation  environment

* Updated pydantic environment with edit functionality

* Error helper function

* Updated README

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Fixed pre-commit issues

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-07-11 10:44:16 +10:00
hjc-puro
6c64df0226 remove jsonlines dependency 2025-07-11 00:42:55 +00:00
hjc-puro
3e1eba6e92 rename method 2025-07-11 00:39:28 +00:00
hjc-puro
468b599ddb Merge branch '2025-07-08-evals' of github.com:NousResearch/atropos into 2025-07-08-evals 2025-07-11 00:34:35 +00:00
hjc-puro
da0d64ae89 linting errors 2025-07-11 00:29:57 +00:00
pre-commit-ci[bot]
5aacb1ea9b [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-07-11 00:26:48 +00:00
hjc-puro
e601251893 gsm8k eval example 2025-07-11 00:22:36 +00:00
hjc-puro
afe4fc318f sample level logging 2025-07-10 02:03:07 +00:00
hjc-puro
eb926dc58b working evals 2025-07-10 01:45:21 +00:00
hjc-puro
f4de3ad6f5 add printing 2025-07-09 23:35:26 +00:00
hjc-puro
a11af27298 add eval saving cli args 2025-07-09 03:12:13 +00:00
Dakota
6386a5e185 fix precommit 2025-07-08 13:46:35 -05:00
dmahan93
734a24613d
Merge pull request #206 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
2025-07-08 13:40:53 -05:00
dmahan93
0328680cbe
Merge pull request #205 from futreall/main
chore: spelling cleanup
2025-07-08 13:40:11 -05:00
hjc-puro
72e75c2b13
Merge pull request #207 from NousResearch/2025-07-07-evaluate
Add evaluate subcommand to cli
2025-07-08 06:53:37 +08:00
pre-commit-ci[bot]
9340791a6c [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-07-07 21:53:41 +00:00
hjc-puro
e2bcf27398 evaluate 2025-07-07 17:46:35 -04:00
hjc-puro
e56257311e describe evaluate subcommand 2025-07-07 17:44:52 -04:00
hjc-puro
5519f190d2 add evaluate subcommand to cli 2025-07-07 17:39:33 -04:00
pre-commit-ci[bot]
8b75cf5667 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-07-07 16:39:20 +00:00
pre-commit-ci[bot]
0cdaa5cea3
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.12.1 → v0.12.2](https://github.com/astral-sh/ruff-pre-commit/compare/v0.12.1...v0.12.2)
2025-07-07 16:38:48 +00:00
futreall
1f52b347ae
chore: spelling cleanup 2025-07-07 19:17:29 +03:00
dmahan93
ecc5eebeca
Merge pull request #167 from paulsengh/feat/pay-to-play-env
feat: add pay-to-play environment with mixture of judges and micropay…
2025-07-07 09:02:46 -05:00
dmahan93
58446dbcb1
Merge pull request #204 from NousResearch/multienv-enforce-mins
Multienv with enforced minimum samples in a batch
2025-07-07 08:53:43 -05:00
Dakota
08e14cc745 feat: add minimum batch allocation support for environments
- Add min_batch_allocation parameter to ensure environments contribute minimum proportion to each batch
- Implement grab_batch_with_minimum_allocations function with proper scaling when allocations exceed 100%
- Add mixed-size group buffering to handle variable-sized data submissions
- Update server to use minimum allocation logic when any env has min_batch_allocation set
- Add comprehensive tests for minimum allocation scenarios
- Update documentation in API README and CONFIG.md
- Update example environments to demonstrate the feature

This feature allows critical environments to guarantee they contribute at least a specified proportion (0.0-1.0) to each training batch, ensuring important data sources are always represented during training.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-07-07 08:50:28 -05:00
dmahan93
3b8d8a6f09
Merge pull request #202 from Myashka/main
Include run name in wandb initialization in BaseEnv
2025-07-07 08:05:47 -05:00
dmahan93
4dfc397845
Merge pull request #203 from Myashka/wait_for_sem-fix
Fix infinite loop in wait_for_sem by updating semaphore values inside loop
2025-07-07 07:56:24 -05:00
Alexey Gorbatovski
35c542328a Fix infinite loop in wait_for_sem by updating semaphore values inside loop 2025-07-06 00:27:45 +03:00
pre-commit-ci[bot]
ee5257522a [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-07-04 14:34:37 +00:00
Alexey Gorbatovski
14c70c0e68 Include run name in wandb initialization in BaseEnv 2025-07-04 17:13:34 +03:00
dmahan93
2c8340bece
Merge pull request #201 from eeemmmmmm/main
polishing docs: error fixes for clarity
2025-07-02 06:08:53 -05:00
emmmm
10f1b466b6
Update curriculum.py 2025-07-02 10:22:33 +02:00
dmahan93
38b963817b
Merge pull request #200 from NousResearch/allow-ignore-value-in-max-trainer-len
allow inf (<= 0 max_token_len) generations if trainer requests it
2025-07-01 09:54:19 -05:00
Dakota
683559afd2 allow inf (<= 0 max_token_len) generations if trainer requests it, but raise a warning so that users can check their logs and get info if their trainers are doing something weird 2025-07-01 09:52:10 -05:00
dmahan93
0ff966c276
Merge pull request #198 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
2025-06-30 20:08:45 -05:00
pre-commit-ci[bot]
4d6fbf4fa7
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.12.0 → v0.12.1](https://github.com/astral-sh/ruff-pre-commit/compare/v0.12.0...v0.12.1)
2025-06-30 16:36:06 +00:00