atropos

mirror of https://github.com/NousResearch/atropos.git synced 2026-04-24 17:04:55 +00:00

Author	SHA1	Message	Date
leopardracer	0d6297ad35	Update default.yaml	2025-06-18 22:23:15 +03:00
leopardracer	117783f5d5	Update README.md	2025-06-18 22:22:38 +03:00
Teknium	202ecff996	Merge pull request #170 from NousResearch/add-format-following-environment Add format following environment	2025-06-16 06:50:17 -07:00
Teknium	9b93e56dbe	Merge pull request #181 from NousResearch/updates-to-instructfollowing-env Add cycling curriculum, difficulty threshold, update datadumps	2025-06-16 06:49:59 -07:00
teknium1	81631b9c59	Merge branch 'updates-to-instructfollowing-env' of https://github.com/NousResearch/atropos into updates-to-instructfollowing-env	2025-06-14 12:32:31 -07:00
teknium1	bf78ad44e3	Add optional solve flagging strategy	2025-06-14 12:32:27 -07:00
pre-commit-ci[bot]	baed9b331e	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-06-14 18:25:45 +00:00
FT	ad7f89d5c2	Update accessibility_env.py	2025-06-14 20:24:01 +02:00
FT	db15736775	Update README.md	2025-06-14 20:22:59 +02:00
pre-commit-ci[bot]	7fa9980b5c	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-06-14 14:47:51 +00:00
teknium1	ad1bdf7f80	Add cycling curriculum, difficulty threshold, update datadumps	2025-06-14 07:44:47 -07:00
fuder.eth	6ec3054591	Update README.md	2025-06-13 14:52:30 +02:00
fuder.eth	9c2a495e75	Update plot.py	2025-06-13 14:51:25 +02:00
Teknium	e75ce6ccce	Merge pull request #176 from emmanuel-ferdman/main Display cat behaviors file path on error	2025-06-13 04:42:48 -07:00
Teknium	eeeb0f1cd2	Merge pull request #172 from NousResearch/improve-data-dumping-in-sweRL add additional data dumping features	2025-06-13 04:40:11 -07:00
pre-commit-ci[bot]	dcb926b73f	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-06-13 11:39:36 +00:00
Teknium	32b739a757	Merge branch 'main' into add-format-following-environment	2025-06-13 04:39:06 -07:00
teknium1	ec6b9bb626	Merge branch 'letter-counting-environment' of https://github.com/NousResearch/atropos into letter-counting-environment	2025-06-13 04:27:32 -07:00
pre-commit-ci[bot]	2f9132ae63	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-06-12 15:20:13 +00:00
Dakota	d3e6ddddbc	fixed pre-commit :)	2025-06-12 10:12:49 -05:00
teknium1	81cb80982c	update some base config options	2025-06-12 00:41:55 -07:00
teknium1	7a89524345	add readme section for the environment	2025-06-12 00:36:03 -07:00
teknium1	4a7e5b2b7c	Many updates	2025-06-12 00:32:50 -07:00
Emmanuel Ferdman	7dd9bf9c5c	Display cat behaviors file path on error Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>	2025-06-11 16:09:20 -07:00
teknium1	199ae15d0b	initital letter counting environment	2025-06-11 15:27:21 -07:00
teknium1	54268a76ce	add additional data dumping features	2025-06-10 01:59:25 -07:00
teknium1	71b1e7023b	Make default configs better	2025-06-10 01:30:40 -07:00
teknium1	7b91614d46	add more info on rejection sampling in readme	2025-06-10 01:25:39 -07:00
teknium1	8e1d160eef	add answer format environment for rejection sampling	2025-06-10 01:20:49 -07:00
dmahan93	a26794afd2	Merge pull request #168 from maximevtush/main Minor Fixes: Typo Correction in README and Message Clarification in Tasks	2025-06-09 14:24:07 -05:00
Dakota	e13526d308	Fix API to accept messages without reward field + comprehensive tests - Made reward field truly optional in messages (no auto-addition) - Accept custom roles (dog, cat, etc.) beyond standard ones - Added 24 new tests for edge cases (tuples, unicode, large content) - Reorganized test structure: moved from testing/ to atroposlib/tests/ - Fixed legacy API tests and removed tests requiring missing data files All 43 tests pass\! Fixes message handling for SFT use cases. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-06-09 14:03:08 -05:00
Maxim Evtush	16bd33284f	Update tasks.py	2025-06-09 15:39:58 +02:00
Maxim Evtush	d0913d187b	Update README.md	2025-06-09 15:39:04 +02:00
paulsengh	b38f014b9f	feat: add pay-to-play environment with mixture of judges and micropayments	2025-06-08 23:36:50 -07:00
teknium1	f999f90627	add support for composite task	2025-06-08 04:39:50 -07:00
teknium1	398e3ddeaa	add randomization for complexity as well as curriculum support	2025-06-08 03:07:07 -07:00
teknium1	a4b22c38d7	make eval vars config options	2025-06-06 15:24:00 -07:00
teknium1	be94857084	add seed to default configs for clarity	2025-06-06 14:56:55 -07:00
interstellarninja	60be1bbbe8	BaseConfigEnv subclass for experimental variables	2025-06-06 04:46:53 -04:00
teknium1	79188d8d6a	Add reasoning gym env	2025-06-05 17:30:25 -07:00
interstellarninja	c5b161764c	Fix tool calling turn filtering in multiturn environment - Change filtering from >= to == MAX_TOOL_CALL_TURNS to ensure exact match - Add VALIDATE_THINK_BLOCKS flag for optional <think> block validation - Refactor data structure from flat expected_calls to turn-based expected_calls_by_turn - Extract helper methods from collect_trajectories for better code organization - Fix Turn 3 issue where prompts ended with tool responses instead of generating tool calls 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-06-05 10:49:29 -04:00
interstellarninja	fdfe23ea39	creating multi-turn tool-use rl environment	2025-06-04 14:13:01 -04:00
Dakota	61fdc37f61	Replace isort with ruff for import sorting - Update pre-commit config to use ruff with --select=I for imports only - Apply ruff import sorting to fix pre-commit issues - Ruff and black work together without conflicts	2025-06-04 11:28:30 -05:00
Dakota	55cdb83cbf	Update pre-commit hooks to latest versions and fix issues - Update pre-commit hooks: v5.0.0, black 25.1.0, isort 6.0.1, flake8 7.2.0 - Fix isort import ordering in lean_proof_env.py - Fix flake8 F824 false positive in spatial_env.py with noqa comment	2025-06-04 10:58:37 -05:00
Dakota	f3bbc6a42d	Fix import ordering with isort - Move typing_extensions import to proper location - Satisfy pre-commit isort requirements	2025-06-04 10:40:41 -05:00
Teknium	eaf1e8e667	Merge pull request #157 from NousResearch/add-struct-output-pydantic-validator-environment Add Pydantic Schema to Structured Output Environment	2025-06-04 02:10:44 -07:00
teknium1	49adab3f4b	Cleanup	2025-06-03 16:10:59 -07:00
teknium1	aa3d01e5ea	Add YAML and TOML support, experimental XML support, add multi-format randomizer	2025-06-03 15:06:21 -07:00
teknium1	e359a72b06	add pydantic schema environment	2025-06-02 16:58:12 -07:00
kilavvy	39261643dc	Update open_robot_env.py	2025-06-02 13:56:48 +02:00

... 6 7 8 9 10 ...

763 commits