Commit graph

108 commits

Author SHA1 Message Date
PLippmann
5a130a3a5b Quote fix 2026-01-06 15:14:26 +01:00
PLippmann
7d8123a526 Missing initialization 2026-01-06 15:14:26 +01:00
PLippmann
c927794248 Add SQL Query Generation Environment 2026-01-06 15:14:26 +01:00
Tonny
a25e299c83
Update README.md 2025-12-22 21:53:28 +03:00
Tonny
8da2b5ae29
Update README.md 2025-12-22 21:50:53 +03:00
Tonny
e0b870f28e
Update README.md 2025-12-22 21:50:39 +03:00
Tonny
1761f08211
Update README.md 2025-12-22 21:50:11 +03:00
Tonny
40f3c1f7e7
Update README.md 2025-12-22 21:49:55 +03:00
Juli
b8f0ba2271
Update README.md 2025-11-20 10:15:08 +01:00
Juli
fc594360ff
Update README.md 2025-11-20 10:14:49 +01:00
Juli
6cbc704d40
Update README.md 2025-11-20 10:14:30 +01:00
Juli
98dc606a87
Update README.md 2025-11-20 10:14:01 +01:00
Juli
b255f0b3ae
Update README.md 2025-11-20 10:13:36 +01:00
Ksenchi
5acb61a811
Update README.md 2025-11-12 07:42:18 +01:00
Ksenchi
cb9560a9af
Update README.md 2025-11-12 07:40:41 +01:00
Ksenchi
b6e5c81662
Update README.md 2025-11-12 07:35:07 +01:00
andrewshab
7e918dfd18
Update lean_env.py 2025-10-14 12:28:13 +02:00
andrewshab
7318c70e41
Rename readme.md to README.md 2025-10-14 11:58:28 +02:00
pre-commit-ci[bot]
34cabbb30f [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-09-15 16:41:26 +00:00
pre-commit-ci[bot]
9d7c2772af [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-09-08 19:45:00 +00:00
Allan Niemerg
0f6c06bb56 Move BLEUBERI environment to community folder
- Moved environments/bleuberi to environments/community/bleuberi
      - Updated .gitmodules to reflect new submodule path
      - Fixed pre-commit formatting issues
      - Cleaned up test output files
2025-09-08 14:38:43 -05:00
Alvarez
bad4fb84df
Update plot.py 2025-08-30 19:22:57 +02:00
Abhaykhanna3
9d7bcc523f Fix(PR): Address reviewer feedback
- Remove redundant requirements.txt
- Fix leading newline in prompt templates
2025-07-28 11:48:02 -05:00
Abhaykhanna3
b5234d4214 Add Word Hunt environment for training models on 4x4 letter grids
- Trie-based solver, official scoring, normalized rewards
- Configurable token limit and detailed README with dictionary download link
- Removes large Dictionary.txt from tracking and adds ignore rules
- All tests pass and pre-commit hooks are clean
2025-07-28 00:37:36 -05:00
Skylar Ray
e889324171
fix: correct quantum environment repository URL 2025-07-16 11:00:45 +03:00
Dakota
6386a5e185 fix precommit 2025-07-08 13:46:35 -05:00
pre-commit-ci[bot]
9340791a6c [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-07-07 21:53:41 +00:00
dmahan93
ecc5eebeca
Merge pull request #167 from paulsengh/feat/pay-to-play-env
feat: add pay-to-play environment with mixture of judges and micropay…
2025-07-07 09:02:46 -05:00
Alex Pikme
ba992757e1
fix dead link README.md 2025-06-25 10:51:21 +02:00
dmahan93
5b2b5e9947
Merge pull request #192 from rnkrtt/main
Fix typo in author name Gurning -> Gurung in community README
2025-06-23 10:15:58 -05:00
Merkel Tranjes
af1c98d7a8
Update README.md 2025-06-23 16:23:02 +02:00
Tomass
14917440db
fix duplicate plot.py 2025-06-23 15:32:54 +02:00
Teknium
8bf0312b8a
Merge pull request #190 from crStiv/a
fix: multiple typos of different importance
2025-06-22 13:15:39 -07:00
Jeremy Melvin
3bed7c64b9
Ethereum Virtual Machine Text to Transaction Environment (#187)
* EVM-text_to_transaction

* update structure

* Update README

---------

Co-authored-by: Jeremy Melvin <jeremy@openblocklabs.com>
2025-06-20 09:16:00 +10:00
crStiv
b65b614132
Update hpo.py 2025-06-19 22:59:42 +02:00
crStiv
e934094173
Update helpers.py 2025-06-19 22:52:43 +02:00
leopardracer
0d6297ad35
Update default.yaml 2025-06-18 22:23:15 +03:00
leopardracer
117783f5d5
Update README.md 2025-06-18 22:22:38 +03:00
pre-commit-ci[bot]
baed9b331e [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-06-14 18:25:45 +00:00
FT
ad7f89d5c2
Update accessibility_env.py 2025-06-14 20:24:01 +02:00
FT
db15736775
Update README.md 2025-06-14 20:22:59 +02:00
fuder.eth
6ec3054591
Update README.md 2025-06-13 14:52:30 +02:00
fuder.eth
9c2a495e75
Update plot.py 2025-06-13 14:51:25 +02:00
Emmanuel Ferdman
7dd9bf9c5c
Display cat behaviors file path on error
Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>
2025-06-11 16:09:20 -07:00
dmahan93
a26794afd2
Merge pull request #168 from maximevtush/main
Minor Fixes: Typo Correction in README and Message Clarification in Tasks
2025-06-09 14:24:07 -05:00
Dakota
e13526d308 Fix API to accept messages without reward field + comprehensive tests
- Made reward field truly optional in messages (no auto-addition)
- Accept custom roles (dog, cat, etc.) beyond standard ones
- Added 24 new tests for edge cases (tuples, unicode, large content)
- Reorganized test structure: moved from testing/ to atroposlib/tests/
- Fixed legacy API tests and removed tests requiring missing data files

All 43 tests pass\! Fixes message handling for SFT use cases.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-06-09 14:03:08 -05:00
Maxim Evtush
16bd33284f
Update tasks.py 2025-06-09 15:39:58 +02:00
Maxim Evtush
d0913d187b
Update README.md 2025-06-09 15:39:04 +02:00
paulsengh
b38f014b9f feat: add pay-to-play environment with mixture of judges and micropayments 2025-06-08 23:36:50 -07:00
Dakota
61fdc37f61 Replace isort with ruff for import sorting
- Update pre-commit config to use ruff with --select=I for imports only
- Apply ruff import sorting to fix pre-commit issues
- Ruff and black work together without conflicts
2025-06-04 11:28:30 -05:00