Commit graph

763 commits

Author SHA1 Message Date
teknium
9607880f3d Lots of updates to the environment to cleanup, add more metrics, make more robust - ties has an issue though 2025-07-14 09:39:00 +00:00
pre-commit-ci[bot]
107809260d [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-07-12 11:23:13 +00:00
teknium
e83d796c74 add pairwise judgement environment 2025-07-12 11:15:56 +00:00
hjc-puro
75a4264f8d
Merge pull request #208 from NousResearch/2025-07-08-evals
Add `evaluate_log` method, gsm8k example
2025-07-12 06:45:05 +08:00
hjc-puro
6e9baaf9d8 table 2025-07-11 09:52:19 +00:00
hjc-puro
352e1b8f88 comments 2025-07-11 03:55:16 +00:00
hjc-puro
b06332623d move time import 2025-07-11 00:45:24 +00:00
Philip Lippmann
17faebae03
Enhanced Pydantic Schema Following Environment with Dynamic Error Introduction and Editing Task Support (#185)
* New JSON env and documentation

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Removed seperate JSON generation  environment

* Updated pydantic environment with edit functionality

* Error helper function

* Updated README

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Fixed pre-commit issues

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-07-11 10:44:16 +10:00
hjc-puro
3e1eba6e92 rename method 2025-07-11 00:39:28 +00:00
pre-commit-ci[bot]
5aacb1ea9b [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-07-11 00:26:48 +00:00
hjc-puro
e601251893 gsm8k eval example 2025-07-11 00:22:36 +00:00
hjc-puro
afe4fc318f sample level logging 2025-07-10 02:03:07 +00:00
hjc-puro
f4de3ad6f5 add printing 2025-07-09 23:35:26 +00:00
Dakota
6386a5e185 fix precommit 2025-07-08 13:46:35 -05:00
dmahan93
0328680cbe
Merge pull request #205 from futreall/main
chore: spelling cleanup
2025-07-08 13:40:11 -05:00
interstellarninja
2b852ee0b7 merging changes 2025-07-08 00:33:29 -04:00
interstellarninja
90c1b703e6 fixing pre-commit errors 2025-07-08 00:30:53 -04:00
pre-commit-ci[bot]
9340791a6c [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-07-07 21:53:41 +00:00
futreall
1f52b347ae
chore: spelling cleanup 2025-07-07 19:17:29 +03:00
dmahan93
ecc5eebeca
Merge pull request #167 from paulsengh/feat/pay-to-play-env
feat: add pay-to-play environment with mixture of judges and micropay…
2025-07-07 09:02:46 -05:00
dmahan93
58446dbcb1
Merge pull request #204 from NousResearch/multienv-enforce-mins
Multienv with enforced minimum samples in a batch
2025-07-07 08:53:43 -05:00
Dakota
08e14cc745 feat: add minimum batch allocation support for environments
- Add min_batch_allocation parameter to ensure environments contribute minimum proportion to each batch
- Implement grab_batch_with_minimum_allocations function with proper scaling when allocations exceed 100%
- Add mixed-size group buffering to handle variable-sized data submissions
- Update server to use minimum allocation logic when any env has min_batch_allocation set
- Add comprehensive tests for minimum allocation scenarios
- Update documentation in API README and CONFIG.md
- Update example environments to demonstrate the feature

This feature allows critical environments to guarantee they contribute at least a specified proportion (0.0-1.0) to each training batch, ensuring important data sources are always represented during training.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-07-07 08:50:28 -05:00
emmmm
10f1b466b6
Update curriculum.py 2025-07-02 10:22:33 +02:00
pre-commit-ci[bot]
ab06a1ed52 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-07-01 03:33:35 +00:00
interstellarninja
256e0498da Merge branch 'feat/interleaved_tool_use' of github.com:interstellarninja/atropos into feat/interleaved_tool_use 2025-06-30 23:31:43 -04:00
interstellarninja
2827f55a04 overriding max_token_len from base 2025-06-30 23:29:58 -04:00
pre-commit-ci[bot]
ac5c341eee [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-06-30 12:37:40 +00:00
interstellarninja
eb7a54de96 tool_use_interleaved_thinking.py 2025-06-30 08:35:10 -04:00
interstellarninja
72e91e5a1d fixing merge errors 2025-06-30 08:32:13 -04:00
pre-commit-ci[bot]
9e02d020fc [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-06-30 12:19:46 +00:00
interstellarninja
a0d53e9bdc Merge branch 'feat/interleaved_tool_use' of github.com:interstellarninja/atropos into feat/interleaved_tool_use 2025-06-30 08:18:12 -04:00
interstellarninja
71ef50ffc7 implementing execution feedback mode 2025-06-30 08:15:30 -04:00
interstellarninja
9ea8ce26c6 Merge branch 'feat/multiturn_tool_use_env' of github.com:interstellarninja/atropos into feat/multiturn_tool_use_env 2025-06-27 01:41:55 -04:00
interstellarninja
b162813048 allowing only one think block 2025-06-26 23:20:30 -04:00
pre-commit-ci[bot]
34d45d2445 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-06-26 17:12:42 +00:00
interstellarninja
ee8755c388 using scenario for single, multistep and multiturn tool calls 2025-06-26 13:11:43 -04:00
Alex Pikme
ba992757e1
fix dead link README.md 2025-06-25 10:51:21 +02:00
pre-commit-ci[bot]
85f7a0b226 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-06-24 19:27:16 +00:00
interstellarninja
1570a8a106 resoling conflicts 2025-06-24 15:25:55 -04:00
interstellarninja
53138404b7 adding dynamic few-shot and controlling max gen per turn 2025-06-24 15:21:42 -04:00
pre-commit-ci[bot]
5ee01a7911 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-06-24 12:26:59 +00:00
interstellarninja
569a8303f3 creating environment for interleaved tool use 2025-06-24 08:21:37 -04:00
interstellarninja
45bc484931 option to generate all gpt turns 2025-06-24 08:14:14 -04:00
dmahan93
5b2b5e9947
Merge pull request #192 from rnkrtt/main
Fix typo in author name Gurning -> Gurung in community README
2025-06-23 10:15:58 -05:00
Merkel Tranjes
af1c98d7a8
Update README.md 2025-06-23 16:23:02 +02:00
Tomass
14917440db
fix duplicate plot.py 2025-06-23 15:32:54 +02:00
Teknium
8bf0312b8a
Merge pull request #190 from crStiv/a
fix: multiple typos of different importance
2025-06-22 13:15:39 -07:00
Jeremy Melvin
3bed7c64b9
Ethereum Virtual Machine Text to Transaction Environment (#187)
* EVM-text_to_transaction

* update structure

* Update README

---------

Co-authored-by: Jeremy Melvin <jeremy@openblocklabs.com>
2025-06-20 09:16:00 +10:00
crStiv
b65b614132
Update hpo.py 2025-06-19 22:59:42 +02:00
crStiv
e934094173
Update helpers.py 2025-06-19 22:52:43 +02:00