Commit graph

756 commits

Author SHA1 Message Date
FT
ad7f89d5c2
Update accessibility_env.py 2025-06-14 20:24:01 +02:00
FT
db15736775
Update README.md 2025-06-14 20:22:59 +02:00
pre-commit-ci[bot]
7fa9980b5c [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-06-14 14:47:51 +00:00
teknium1
ad1bdf7f80 Add cycling curriculum, difficulty threshold, update datadumps 2025-06-14 07:44:47 -07:00
fuder.eth
6ec3054591
Update README.md 2025-06-13 14:52:30 +02:00
fuder.eth
9c2a495e75
Update plot.py 2025-06-13 14:51:25 +02:00
Teknium
e75ce6ccce
Merge pull request #176 from emmanuel-ferdman/main
Display cat behaviors file path on error
2025-06-13 04:42:48 -07:00
Teknium
eeeb0f1cd2
Merge pull request #172 from NousResearch/improve-data-dumping-in-sweRL
add additional data dumping features
2025-06-13 04:40:11 -07:00
pre-commit-ci[bot]
dcb926b73f [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-06-13 11:39:36 +00:00
Teknium
32b739a757
Merge branch 'main' into add-format-following-environment 2025-06-13 04:39:06 -07:00
teknium1
ec6b9bb626 Merge branch 'letter-counting-environment' of https://github.com/NousResearch/atropos into letter-counting-environment 2025-06-13 04:27:32 -07:00
pre-commit-ci[bot]
2f9132ae63 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-06-12 15:20:13 +00:00
Dakota
d3e6ddddbc fixed pre-commit :) 2025-06-12 10:12:49 -05:00
teknium1
81cb80982c update some base config options 2025-06-12 00:41:55 -07:00
teknium1
7a89524345 add readme section for the environment 2025-06-12 00:36:03 -07:00
teknium1
4a7e5b2b7c Many updates 2025-06-12 00:32:50 -07:00
Emmanuel Ferdman
7dd9bf9c5c
Display cat behaviors file path on error
Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>
2025-06-11 16:09:20 -07:00
teknium1
199ae15d0b initital letter counting environment 2025-06-11 15:27:21 -07:00
teknium1
54268a76ce add additional data dumping features 2025-06-10 01:59:25 -07:00
teknium1
71b1e7023b Make default configs better 2025-06-10 01:30:40 -07:00
teknium1
7b91614d46 add more info on rejection sampling in readme 2025-06-10 01:25:39 -07:00
teknium1
8e1d160eef add answer format environment for rejection sampling 2025-06-10 01:20:49 -07:00
dmahan93
a26794afd2
Merge pull request #168 from maximevtush/main
Minor Fixes: Typo Correction in README and Message Clarification in Tasks
2025-06-09 14:24:07 -05:00
Dakota
e13526d308 Fix API to accept messages without reward field + comprehensive tests
- Made reward field truly optional in messages (no auto-addition)
- Accept custom roles (dog, cat, etc.) beyond standard ones
- Added 24 new tests for edge cases (tuples, unicode, large content)
- Reorganized test structure: moved from testing/ to atroposlib/tests/
- Fixed legacy API tests and removed tests requiring missing data files

All 43 tests pass\! Fixes message handling for SFT use cases.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-06-09 14:03:08 -05:00
Maxim Evtush
16bd33284f
Update tasks.py 2025-06-09 15:39:58 +02:00
Maxim Evtush
d0913d187b
Update README.md 2025-06-09 15:39:04 +02:00
paulsengh
b38f014b9f feat: add pay-to-play environment with mixture of judges and micropayments 2025-06-08 23:36:50 -07:00
teknium1
f999f90627 add support for composite task 2025-06-08 04:39:50 -07:00
teknium1
398e3ddeaa add randomization for complexity as well as curriculum support 2025-06-08 03:07:07 -07:00
teknium1
a4b22c38d7 make eval vars config options 2025-06-06 15:24:00 -07:00
teknium1
be94857084 add seed to default configs for clarity 2025-06-06 14:56:55 -07:00
interstellarninja
60be1bbbe8 BaseConfigEnv subclass for experimental variables 2025-06-06 04:46:53 -04:00
teknium1
79188d8d6a Add reasoning gym env 2025-06-05 17:30:25 -07:00
interstellarninja
c5b161764c Fix tool calling turn filtering in multiturn environment
- Change filtering from >= to == MAX_TOOL_CALL_TURNS to ensure exact match
- Add VALIDATE_THINK_BLOCKS flag for optional <think> block validation
- Refactor data structure from flat expected_calls to turn-based expected_calls_by_turn
- Extract helper methods from collect_trajectories for better code organization
- Fix Turn 3 issue where prompts ended with tool responses instead of generating tool calls

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-06-05 10:49:29 -04:00
interstellarninja
fdfe23ea39 creating multi-turn tool-use rl environment 2025-06-04 14:13:01 -04:00
Dakota
61fdc37f61 Replace isort with ruff for import sorting
- Update pre-commit config to use ruff with --select=I for imports only
- Apply ruff import sorting to fix pre-commit issues
- Ruff and black work together without conflicts
2025-06-04 11:28:30 -05:00
Dakota
55cdb83cbf Update pre-commit hooks to latest versions and fix issues
- Update pre-commit hooks: v5.0.0, black 25.1.0, isort 6.0.1, flake8 7.2.0
- Fix isort import ordering in lean_proof_env.py
- Fix flake8 F824 false positive in spatial_env.py with noqa comment
2025-06-04 10:58:37 -05:00
Dakota
f3bbc6a42d Fix import ordering with isort
- Move typing_extensions import to proper location
- Satisfy pre-commit isort requirements
2025-06-04 10:40:41 -05:00
Teknium
eaf1e8e667
Merge pull request #157 from NousResearch/add-struct-output-pydantic-validator-environment
Add Pydantic Schema to Structured Output Environment
2025-06-04 02:10:44 -07:00
teknium1
49adab3f4b Cleanup 2025-06-03 16:10:59 -07:00
teknium1
aa3d01e5ea Add YAML and TOML support, experimental XML support, add multi-format randomizer 2025-06-03 15:06:21 -07:00
teknium1
e359a72b06 add pydantic schema environment 2025-06-02 16:58:12 -07:00
kilavvy
39261643dc
Update open_robot_env.py 2025-06-02 13:56:48 +02:00
kilavvy
33b8e39599
Update README.md 2025-06-02 13:55:02 +02:00
Teknium
134a9713ce
Merge pull request #147 from zeevick10/main
Fix Typos in MCP Tool Calling Environment Documentation
2025-06-01 14:24:54 -07:00
Emmanuel Ferdman
2bc9111139
Fix contribution guide source
Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>
2025-05-31 13:18:17 -07:00
shannonsands
283877dd88
Intern bootcamp env (#146)
* Created registry and started off the env

* Local testing works

* process working but error in gen

* removed old code

* adding debug, it's still not progressing to collect trajectories

* linting

* removed redundant settings
2025-05-31 11:22:59 +10:00
FT
20a90764ac
Update README.md 2025-05-28 14:33:48 +02:00
shannonsands
ea304892ee
Integrate chinguun101 goofy math (#145)
* Add GoofyMath environment for fun, engaging math learning

* linting, moved to community folder

* linting

---------

Co-authored-by: chinguun101 <chinguun@uni.minerva.edu>
2025-05-28 12:11:02 +10:00
shannonsands
1a79132809
Integrate michaelwaves options iv (#144)
* options iv agent

* bug fix

* outputs

* linted and moved to community folder

* linting

---------

Co-authored-by: michaelwaves <michaelyu713705@gmail.com>
2025-05-28 10:57:24 +10:00