Commit graph

82 commits

Author SHA1 Message Date
pre-commit-ci[bot]
9340791a6c [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-07-07 21:53:41 +00:00
dmahan93
ecc5eebeca
Merge pull request #167 from paulsengh/feat/pay-to-play-env
feat: add pay-to-play environment with mixture of judges and micropay…
2025-07-07 09:02:46 -05:00
Alex Pikme
ba992757e1
fix dead link README.md 2025-06-25 10:51:21 +02:00
dmahan93
5b2b5e9947
Merge pull request #192 from rnkrtt/main
Fix typo in author name Gurning -> Gurung in community README
2025-06-23 10:15:58 -05:00
Merkel Tranjes
af1c98d7a8
Update README.md 2025-06-23 16:23:02 +02:00
Tomass
14917440db
fix duplicate plot.py 2025-06-23 15:32:54 +02:00
Teknium
8bf0312b8a
Merge pull request #190 from crStiv/a
fix: multiple typos of different importance
2025-06-22 13:15:39 -07:00
Jeremy Melvin
3bed7c64b9
Ethereum Virtual Machine Text to Transaction Environment (#187)
* EVM-text_to_transaction

* update structure

* Update README

---------

Co-authored-by: Jeremy Melvin <jeremy@openblocklabs.com>
2025-06-20 09:16:00 +10:00
crStiv
b65b614132
Update hpo.py 2025-06-19 22:59:42 +02:00
crStiv
e934094173
Update helpers.py 2025-06-19 22:52:43 +02:00
leopardracer
0d6297ad35
Update default.yaml 2025-06-18 22:23:15 +03:00
leopardracer
117783f5d5
Update README.md 2025-06-18 22:22:38 +03:00
pre-commit-ci[bot]
baed9b331e [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-06-14 18:25:45 +00:00
FT
ad7f89d5c2
Update accessibility_env.py 2025-06-14 20:24:01 +02:00
FT
db15736775
Update README.md 2025-06-14 20:22:59 +02:00
fuder.eth
6ec3054591
Update README.md 2025-06-13 14:52:30 +02:00
fuder.eth
9c2a495e75
Update plot.py 2025-06-13 14:51:25 +02:00
Emmanuel Ferdman
7dd9bf9c5c
Display cat behaviors file path on error
Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>
2025-06-11 16:09:20 -07:00
dmahan93
a26794afd2
Merge pull request #168 from maximevtush/main
Minor Fixes: Typo Correction in README and Message Clarification in Tasks
2025-06-09 14:24:07 -05:00
Dakota
e13526d308 Fix API to accept messages without reward field + comprehensive tests
- Made reward field truly optional in messages (no auto-addition)
- Accept custom roles (dog, cat, etc.) beyond standard ones
- Added 24 new tests for edge cases (tuples, unicode, large content)
- Reorganized test structure: moved from testing/ to atroposlib/tests/
- Fixed legacy API tests and removed tests requiring missing data files

All 43 tests pass\! Fixes message handling for SFT use cases.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-06-09 14:03:08 -05:00
Maxim Evtush
16bd33284f
Update tasks.py 2025-06-09 15:39:58 +02:00
Maxim Evtush
d0913d187b
Update README.md 2025-06-09 15:39:04 +02:00
paulsengh
b38f014b9f feat: add pay-to-play environment with mixture of judges and micropayments 2025-06-08 23:36:50 -07:00
Dakota
61fdc37f61 Replace isort with ruff for import sorting
- Update pre-commit config to use ruff with --select=I for imports only
- Apply ruff import sorting to fix pre-commit issues
- Ruff and black work together without conflicts
2025-06-04 11:28:30 -05:00
Dakota
55cdb83cbf Update pre-commit hooks to latest versions and fix issues
- Update pre-commit hooks: v5.0.0, black 25.1.0, isort 6.0.1, flake8 7.2.0
- Fix isort import ordering in lean_proof_env.py
- Fix flake8 F824 false positive in spatial_env.py with noqa comment
2025-06-04 10:58:37 -05:00
Dakota
f3bbc6a42d Fix import ordering with isort
- Move typing_extensions import to proper location
- Satisfy pre-commit isort requirements
2025-06-04 10:40:41 -05:00
kilavvy
39261643dc
Update open_robot_env.py 2025-06-02 13:56:48 +02:00
kilavvy
33b8e39599
Update README.md 2025-06-02 13:55:02 +02:00
Teknium
134a9713ce
Merge pull request #147 from zeevick10/main
Fix Typos in MCP Tool Calling Environment Documentation
2025-06-01 14:24:54 -07:00
Emmanuel Ferdman
2bc9111139
Fix contribution guide source
Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>
2025-05-31 13:18:17 -07:00
FT
20a90764ac
Update README.md 2025-05-28 14:33:48 +02:00
shannonsands
ea304892ee
Integrate chinguun101 goofy math (#145)
* Add GoofyMath environment for fun, engaging math learning

* linting, moved to community folder

* linting

---------

Co-authored-by: chinguun101 <chinguun@uni.minerva.edu>
2025-05-28 12:11:02 +10:00
shannonsands
1a79132809
Integrate michaelwaves options iv (#144)
* options iv agent

* bug fix

* outputs

* linted and moved to community folder

* linting

---------

Co-authored-by: michaelwaves <michaelyu713705@gmail.com>
2025-05-28 10:57:24 +10:00
shannonsands
f21154ff49
Integrate aniemerg wikipedia (#143)
* initial commit

* initial draft of wikipedia article creation environment

* add openai for rollouts, update requirements, create script to run, etc.

* add configuration, add debugging, fix tool calls, prevent wikipedia access

* now creates html file

* fix output for html page

* check in Claude plan

* fixed formatting and other issues

* add zip file

* update README

* linting, moved to community folder

* linting

* linting

* linting

* linting

---------

Co-authored-by: Allan Niemerg <niemerg@gmail.com>
2025-05-28 10:22:11 +10:00
shannonsands
b774e97215
Integrate subrahmanyam cybersecurity (#142)
* cybersecurity env for offline RL trajectories

* output file addition

* jsonl outputs

* code cleanup

* pulled out outputs and fixing .gitignore

* removed zip file

* gitignore typo fix

* Integrate cybersecurity Sigma rule generation environment

---------

Co-authored-by: Subrahmanyam Arunachalam <subrahmanyam.arunachalam@FVFGK0VTQ05P.local>
2025-05-28 08:41:51 +10:00
Shannon Sands
d2fb30c4d0 linting 2025-05-27 16:36:14 +10:00
Shannon Sands
6acb4cc91a linting 2025-05-27 16:34:52 +10:00
Shannon Sands
e37e02a94c linting, moved env to community folder 2025-05-27 16:30:11 +10:00
Shannon Sands
a6ac7a3e42 linting and moved to community 2025-05-27 16:04:50 +10:00
Shannon Sands
8f08a398c0 linting 2025-05-27 15:46:10 +10:00
Shannon Sands
a353bb6eb1 linting 2025-05-27 15:45:07 +10:00
Shannon Sands
e7e747a396 linting 2025-05-27 15:43:12 +10:00
Shannon Sands
2efb690a24 linting, moved to community 2025-05-27 15:36:24 +10:00
Shannon Sands
33d5e4a25e linting 2025-05-27 15:12:14 +10:00
Shannon Sands
eba7aac72b linting, moving files into community subdirectory 2025-05-27 15:08:30 +10:00
Shannon Sands
d3803f62f2 Fix trailing whitespace in community README 2025-05-27 13:58:40 +10:00
Shannon Sands
f8912ae41d linting, moved to community folder 2025-05-27 13:50:43 +10:00
Shannon Sands
c6a0439ec6 Integrate Sanskrit Poetry Environment from KhoomeiK - Add ChandasMeterReward to reward function registry - Move sanskrit_poetry_env.py to environments/community/sanskrit_poetry/ - Add comprehensive documentation as entry #25 in community README - Environment supports traditional Sanskrit meter validation using chandas classifier - Includes IAST to SLP1 transliteration for accurate meter analysis - Fixed code formatting with pre-commit hooks 2025-05-27 13:29:45 +10:00
Shannon Sands
89b38a233b more linting 2025-05-27 13:09:07 +10:00
Shannon Sands
bfdf862829 more linting 2025-05-27 13:06:34 +10:00