- Made reward field truly optional in messages (no auto-addition)
- Accept custom roles (dog, cat, etc.) beyond standard ones
- Added 24 new tests for edge cases (tuples, unicode, large content)
- Reorganized test structure: moved from testing/ to atroposlib/tests/
- Fixed legacy API tests and removed tests requiring missing data files
All 43 tests pass\! Fixes message handling for SFT use cases.
🤖 Generated with [Claude Code](https://claude.ai/code)
Co-Authored-By: Claude <noreply@anthropic.com>
- Change filtering from >= to == MAX_TOOL_CALL_TURNS to ensure exact match
- Add VALIDATE_THINK_BLOCKS flag for optional <think> block validation
- Refactor data structure from flat expected_calls to turn-based expected_calls_by_turn
- Extract helper methods from collect_trajectories for better code organization
- Fix Turn 3 issue where prompts ended with tool responses instead of generating tool calls
🤖 Generated with [Claude Code](https://claude.ai/code)
Co-Authored-By: Claude <noreply@anthropic.com>
- Update pre-commit config to use ruff with --select=I for imports only
- Apply ruff import sorting to fix pre-commit issues
- Ruff and black work together without conflicts
* Created registry and started off the env
* Local testing works
* process working but error in gen
* removed old code
* adding debug, it's still not progressing to collect trajectories
* linting
* removed redundant settings
* Add GoofyMath environment for fun, engaging math learning
* linting, moved to community folder
* linting
---------
Co-authored-by: chinguun101 <chinguun@uni.minerva.edu>
* options iv agent
* bug fix
* outputs
* linted and moved to community folder
* linting
---------
Co-authored-by: michaelwaves <michaelyu713705@gmail.com>
* initial commit
* initial draft of wikipedia article creation environment
* add openai for rollouts, update requirements, create script to run, etc.
* add configuration, add debugging, fix tool calls, prevent wikipedia access
* now creates html file
* fix output for html page
* check in Claude plan
* fixed formatting and other issues
* add zip file
* update README
* linting, moved to community folder
* linting
* linting
* linting
* linting
---------
Co-authored-by: Allan Niemerg <niemerg@gmail.com>