atropos

mirror of https://github.com/NousResearch/atropos.git synced 2026-04-25 17:10:42 +00:00

Author	SHA1	Message	Date
Teknium	62cee8ac66	Merge pull request #209 from NousResearch/add-pairwise-judge-environment Add LLM as a judge environment for eval and train based on RewardBench	2025-07-16 13:37:09 -07:00
pre-commit-ci[bot]	6455c305e6	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-07-16 17:51:15 +00:00
teknium	542185bbcc	Merge branch 'add-pairwise-judge-environment' of https://github.com/NousResearch/atropos into add-pairwise-judge-environment	2025-07-16 17:48:44 +00:00
teknium	a43520e619	one last linter...	2025-07-16 17:48:43 +00:00
pre-commit-ci[bot]	eab2c938ea	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-07-16 16:58:42 +00:00
teknium	18f228615d	linter stuff	2025-07-16 16:57:51 +00:00
pre-commit-ci[bot]	ffc210e470	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-07-16 16:51:19 +00:00
teknium	2f37714e84	Merge branch 'add-pairwise-judge-environment' of https://github.com/NousResearch/atropos into add-pairwise-judge-environment	2025-07-16 16:50:04 +00:00
teknium	0113dc906b	add a bunch of extra debugging traces - configurable	2025-07-16 16:49:42 +00:00
Skylar Ray	e889324171	fix: correct quantum environment repository URL	2025-07-16 11:00:45 +03:00
pre-commit-ci[bot]	1af508b27f	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-07-16 07:46:52 +00:00
teknium	10bb22f557	adding debugging	2025-07-16 07:46:17 +00:00
pre-commit-ci[bot]	7d980372d3	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-07-15 18:40:26 +00:00
teknium	02ad3e8661	Merge branch 'add-pairwise-judge-environment' of https://github.com/NousResearch/atropos into add-pairwise-judge-environment	2025-07-15 18:39:52 +00:00
teknium	8aa540275b	add to the envs readme	2025-07-15 18:39:50 +00:00
pre-commit-ci[bot]	9f3e2ee460	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-07-15 18:24:13 +00:00
teknium	856a8455b1	please the precommit gods	2025-07-15 18:20:44 +00:00
pre-commit-ci[bot]	c053a9f134	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-07-15 11:40:22 +00:00
teknium	ce1f72059c	Merge branch 'add-pairwise-judge-environment' of https://github.com/NousResearch/atropos into add-pairwise-judge-environment	2025-07-15 11:39:46 +00:00
teknium	47c396c43f	switch to chat completions endpoint to eval closed lab stuff	2025-07-15 11:39:29 +00:00
pre-commit-ci[bot]	818ec9d7c1	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-07-15 11:18:03 +00:00
teknium	982645ce73	Implement proper ties category scoring	2025-07-15 11:16:15 +00:00
pre-commit-ci[bot]	41c847ddf4	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-07-14 09:43:20 +00:00
teknium	ef04098718	glitch	2025-07-14 09:42:44 +00:00
teknium	51d4d52765	Merge branch 'add-pairwise-judge-environment' of https://github.com/NousResearch/atropos into add-pairwise-judge-environment	2025-07-14 09:42:21 +00:00
teknium	9607880f3d	Lots of updates to the environment to cleanup, add more metrics, make more robust - ties has an issue though	2025-07-14 09:39:00 +00:00
pre-commit-ci[bot]	107809260d	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-07-12 11:23:13 +00:00
teknium	e83d796c74	add pairwise judgement environment	2025-07-12 11:15:56 +00:00
hjc-puro	75a4264f8d	Merge pull request #208 from NousResearch/2025-07-08-evals Add `evaluate_log` method, gsm8k example	2025-07-12 06:45:05 +08:00
hjc-puro	6e9baaf9d8	table	2025-07-11 09:52:19 +00:00
hjc-puro	352e1b8f88	comments	2025-07-11 03:55:16 +00:00
hjc-puro	b06332623d	move time import	2025-07-11 00:45:24 +00:00
Philip Lippmann	17faebae03	Enhanced Pydantic Schema Following Environment with Dynamic Error Introduction and Editing Task Support (#185 ) * New JSON env and documentation * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Removed seperate JSON generation environment * Updated pydantic environment with edit functionality * Error helper function * Updated README * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixed pre-commit issues --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-07-11 10:44:16 +10:00
hjc-puro	3e1eba6e92	rename method	2025-07-11 00:39:28 +00:00
pre-commit-ci[bot]	5aacb1ea9b	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-07-11 00:26:48 +00:00
hjc-puro	e601251893	gsm8k eval example	2025-07-11 00:22:36 +00:00
hjc-puro	afe4fc318f	sample level logging	2025-07-10 02:03:07 +00:00
hjc-puro	f4de3ad6f5	add printing	2025-07-09 23:35:26 +00:00
Dakota	6386a5e185	fix precommit	2025-07-08 13:46:35 -05:00
dmahan93	0328680cbe	Merge pull request #205 from futreall/main chore: spelling cleanup	2025-07-08 13:40:11 -05:00
pre-commit-ci[bot]	9340791a6c	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-07-07 21:53:41 +00:00
futreall	1f52b347ae	chore: spelling cleanup	2025-07-07 19:17:29 +03:00
dmahan93	ecc5eebeca	Merge pull request #167 from paulsengh/feat/pay-to-play-env feat: add pay-to-play environment with mixture of judges and micropay…	2025-07-07 09:02:46 -05:00
dmahan93	58446dbcb1	Merge pull request #204 from NousResearch/multienv-enforce-mins Multienv with enforced minimum samples in a batch	2025-07-07 08:53:43 -05:00
Dakota	08e14cc745	feat: add minimum batch allocation support for environments - Add min_batch_allocation parameter to ensure environments contribute minimum proportion to each batch - Implement grab_batch_with_minimum_allocations function with proper scaling when allocations exceed 100% - Add mixed-size group buffering to handle variable-sized data submissions - Update server to use minimum allocation logic when any env has min_batch_allocation set - Add comprehensive tests for minimum allocation scenarios - Update documentation in API README and CONFIG.md - Update example environments to demonstrate the feature This feature allows critical environments to guarantee they contribute at least a specified proportion (0.0-1.0) to each training batch, ensuring important data sources are always represented during training. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-07 08:50:28 -05:00
emmmm	10f1b466b6	Update curriculum.py	2025-07-02 10:22:33 +02:00
Alex Pikme	ba992757e1	fix dead link README.md	2025-06-25 10:51:21 +02:00
dmahan93	5b2b5e9947	Merge pull request #192 from rnkrtt/main Fix typo in author name Gurning -> Gurung in community README	2025-06-23 10:15:58 -05:00
Merkel Tranjes	af1c98d7a8	Update README.md	2025-06-23 16:23:02 +02:00
Tomass	14917440db	fix duplicate plot.py	2025-06-23 15:32:54 +02:00

1 2 3 4 5 ...

459 commits