Commit graph

371 commits

Author SHA1 Message Date
pre-commit-ci[bot]
321478dd5f [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-08-12 06:45:36 +00:00
interstellarninja
2f6025e65b fixing precommit formatting errors 2025-08-12 02:42:47 -04:00
pre-commit-ci[bot]
0d05750841 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-07-24 10:58:41 +00:00
interstellarninja
12d77a1e44 fixing precommit errors 2025-07-24 06:53:41 -04:00
interstellarninja
9ea8ce26c6 Merge branch 'feat/multiturn_tool_use_env' of github.com:interstellarninja/atropos into feat/multiturn_tool_use_env 2025-06-27 01:41:55 -04:00
interstellarninja
b162813048 allowing only one think block 2025-06-26 23:20:30 -04:00
pre-commit-ci[bot]
34d45d2445 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-06-26 17:12:42 +00:00
interstellarninja
ee8755c388 using scenario for single, multistep and multiturn tool calls 2025-06-26 13:11:43 -04:00
interstellarninja
45bc484931 option to generate all gpt turns 2025-06-24 08:14:14 -04:00
interstellarninja
60be1bbbe8 BaseConfigEnv subclass for experimental variables 2025-06-06 04:46:53 -04:00
interstellarninja
c5b161764c Fix tool calling turn filtering in multiturn environment
- Change filtering from >= to == MAX_TOOL_CALL_TURNS to ensure exact match
- Add VALIDATE_THINK_BLOCKS flag for optional <think> block validation
- Refactor data structure from flat expected_calls to turn-based expected_calls_by_turn
- Extract helper methods from collect_trajectories for better code organization
- Fix Turn 3 issue where prompts ended with tool responses instead of generating tool calls

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-06-05 10:49:29 -04:00
interstellarninja
fdfe23ea39 creating multi-turn tool-use rl environment 2025-06-04 14:13:01 -04:00
kilavvy
39261643dc
Update open_robot_env.py 2025-06-02 13:56:48 +02:00
kilavvy
33b8e39599
Update README.md 2025-06-02 13:55:02 +02:00
Teknium
134a9713ce
Merge pull request #147 from zeevick10/main
Fix Typos in MCP Tool Calling Environment Documentation
2025-06-01 14:24:54 -07:00
Emmanuel Ferdman
2bc9111139
Fix contribution guide source
Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>
2025-05-31 13:18:17 -07:00
shannonsands
283877dd88
Intern bootcamp env (#146)
* Created registry and started off the env

* Local testing works

* process working but error in gen

* removed old code

* adding debug, it's still not progressing to collect trajectories

* linting

* removed redundant settings
2025-05-31 11:22:59 +10:00
FT
20a90764ac
Update README.md 2025-05-28 14:33:48 +02:00
shannonsands
ea304892ee
Integrate chinguun101 goofy math (#145)
* Add GoofyMath environment for fun, engaging math learning

* linting, moved to community folder

* linting

---------

Co-authored-by: chinguun101 <chinguun@uni.minerva.edu>
2025-05-28 12:11:02 +10:00
shannonsands
1a79132809
Integrate michaelwaves options iv (#144)
* options iv agent

* bug fix

* outputs

* linted and moved to community folder

* linting

---------

Co-authored-by: michaelwaves <michaelyu713705@gmail.com>
2025-05-28 10:57:24 +10:00
fuder.eth
1862b193ee
Update README.md (#118) 2025-05-28 10:24:12 +10:00
shannonsands
f21154ff49
Integrate aniemerg wikipedia (#143)
* initial commit

* initial draft of wikipedia article creation environment

* add openai for rollouts, update requirements, create script to run, etc.

* add configuration, add debugging, fix tool calls, prevent wikipedia access

* now creates html file

* fix output for html page

* check in Claude plan

* fixed formatting and other issues

* add zip file

* update README

* linting, moved to community folder

* linting

* linting

* linting

* linting

---------

Co-authored-by: Allan Niemerg <niemerg@gmail.com>
2025-05-28 10:22:11 +10:00
shannonsands
b774e97215
Integrate subrahmanyam cybersecurity (#142)
* cybersecurity env for offline RL trajectories

* output file addition

* jsonl outputs

* code cleanup

* pulled out outputs and fixing .gitignore

* removed zip file

* gitignore typo fix

* Integrate cybersecurity Sigma rule generation environment

---------

Co-authored-by: Subrahmanyam Arunachalam <subrahmanyam.arunachalam@FVFGK0VTQ05P.local>
2025-05-28 08:41:51 +10:00
Shannon Sands
d2fb30c4d0 linting 2025-05-27 16:36:14 +10:00
Shannon Sands
6acb4cc91a linting 2025-05-27 16:34:52 +10:00
Shannon Sands
e37e02a94c linting, moved env to community folder 2025-05-27 16:30:11 +10:00
Shannon Sands
a403b16ec4 commiting 2025-05-27 16:18:26 +10:00
Shannon Sands
a6ac7a3e42 linting and moved to community 2025-05-27 16:04:50 +10:00
Shannon Sands
8df34efc56 Resolve merge conflict in .gitignore 2025-05-27 15:56:22 +10:00
Shannon Sands
8f08a398c0 linting 2025-05-27 15:46:10 +10:00
Shannon Sands
a353bb6eb1 linting 2025-05-27 15:45:07 +10:00
Shannon Sands
e7e747a396 linting 2025-05-27 15:43:12 +10:00
Shannon Sands
2efb690a24 linting, moved to community 2025-05-27 15:36:24 +10:00
Shannon Sands
33d5e4a25e linting 2025-05-27 15:12:14 +10:00
Shannon Sands
eba7aac72b linting, moving files into community subdirectory 2025-05-27 15:08:30 +10:00
Shannon Sands
d3803f62f2 Fix trailing whitespace in community README 2025-05-27 13:58:40 +10:00
Shannon Sands
f8912ae41d linting, moved to community folder 2025-05-27 13:50:43 +10:00
Shannon Sands
c6a0439ec6 Integrate Sanskrit Poetry Environment from KhoomeiK - Add ChandasMeterReward to reward function registry - Move sanskrit_poetry_env.py to environments/community/sanskrit_poetry/ - Add comprehensive documentation as entry #25 in community README - Environment supports traditional Sanskrit meter validation using chandas classifier - Includes IAST to SLP1 transliteration for accurate meter analysis - Fixed code formatting with pre-commit hooks 2025-05-27 13:29:45 +10:00
Shannon Sands
f789e01347 more linting 2025-05-27 13:15:26 +10:00
Shannon Sands
b82e23f11d more linting 2025-05-27 13:14:01 +10:00
Shannon Sands
54e574d350 more linting 2025-05-27 13:11:36 +10:00
Shannon Sands
89b38a233b more linting 2025-05-27 13:09:07 +10:00
Shannon Sands
bfdf862829 more linting 2025-05-27 13:06:34 +10:00
Shannon Sands
46892c7bdc linting & moved to community 2025-05-27 12:52:37 +10:00
Shannon Sands
7b194642b3 Remove uv.lock file - blocked in gitignore 2025-05-27 12:46:33 +10:00
Shannon Sands
ec2b6f093d linting 2025-05-27 12:29:10 +10:00
Shannon Sands
54967ecae9 linting 2025-05-27 12:15:15 +10:00
Shannon Sands
13a70e09ab Merge remote-tracking branch 'hallerite/protein_env' into merge-hallerite-contributions 2025-05-27 09:05:15 +10:00
Shannon Sands
3d15f0482c linting 2025-05-27 08:59:03 +10:00
Shannon Sands
8b09ace467 Linting, move env to community 2025-05-27 08:53:06 +10:00