reasoning-gym

mirror of https://github.com/open-thought/reasoning-gym.git synced 2026-04-19 12:58:07 +00:00

Author	SHA1	Message	Date
Andreas Köpf	052d76d2ca	Merge pull request #55 from Miserlou/rich/fflogic Adds Zebra/Murdle/Einstein/Grid Style Puzzles	2025-02-03 17:52:22 +01:00
Rich Jones	4a19aa8f14	readme	2025-02-03 16:49:18 +01:00
Rich Jones	4d950e562a	cleanup	2025-02-03 16:47:29 +01:00
Andreas Köpf	c6fbff7d8f	Merge pull request #52 from cavit99/main Improve Word Ladder and add complete example suite	2025-02-03 15:16:29 +01:00
Rich Jones	7274f79c50	precommit hook linting	2025-02-03 14:40:58 +01:00
Rich Jones	0c9094e9f4	adds zebrapuzzles	2025-02-03 14:34:57 +01:00
Andreas Koepf	5d84b6bec5	update GALLERY.md	2025-02-03 12:42:11 +01:00
Andreas Köpf	57c605002a	Merge pull request #50 from joenorton/toh-score-answer feat: toh scoring	2025-02-03 12:39:45 +01:00
Cavit Erginsoy	aff0fecef4	lint	2025-02-03 11:35:30 +00:00
Andreas Köpf	00961bc72a	Merge pull request #39 from joenorton/palindrome_generation feat: add palindrome_generation	2025-02-03 10:10:29 +01:00
Cavit Erginsoy	9b1068ea39	Merge remote-tracking branch 'upstream/main'	2025-02-03 07:44:32 +00:00
Cavit Erginsoy	15f5c8158b	Add word ladder dataset to GALLERY.md - Documented word ladder dataset configuration and generation details - Included three example tasks demonstrating word transformation scenarios - Updated table of contents with new dataset entry	2025-02-03 07:25:11 +00:00
Cavit Erginsoy	4355d5a5fe	Comprehensive test suite for word ladder dataset generation - Added extensive test coverage for word ladder configuration validation - Implemented tests for neighbor computation, path finding, and graph caching - Included performance and edge case tests - Verified solution optimality and path generation logic - Added tests for word set loading and pair generation	2025-02-03 07:22:03 +00:00
Cavit Erginsoy	d5065955a8	Refactor word ladder generation with improved validation and graph-based path finding - Enhanced configuration validation with size and length constraints - Implemented graph-based neighbor computation and caching - Simplified path finding algorithm with more robust length checking - Added more flexible word set loading with configurable length ranges - Improved error handling for dataset generation	2025-02-03 07:21:43 +00:00
Cavit Erginsoy	7b61fc5043	Completed: full example suite	2025-02-03 07:21:12 +00:00
Cavit Erginsoy	08f300911f	add .DS_Store	2025-02-03 07:20:17 +00:00
Cavit Erginsoy	ade33e1a22	filtered out lesser known words to aid model learning ease	2025-02-03 07:19:30 +00:00
Cavit Erginsoy	c0a8a9e46f	update test to match	2025-02-03 03:27:49 +00:00
Joe Norton	731d36f43f	add palindrome score_answer add palindrome score_answer & test	2025-02-02 18:04:47 -08:00
Joe Norton	9841f64ccd	add dependency	2025-02-02 16:46:07 -08:00
Joe Norton	8222823c28	add toh score_answer	2025-02-02 16:37:20 -08:00
Andreas Koepf	3aeec71523	add attribution for arc-1d and unit tests	2025-02-02 23:45:25 +01:00
Andreas Koepf (aider)	a9549057e9	test: Add scoring tests for Arc1D dataset answer evaluation	2025-02-02 23:31:20 +01:00
Andreas Koepf	b7532f66ca	test: Remove test_arc_1d.py file from tests directory	2025-02-02 23:30:15 +01:00
Andreas Koepf (aider)	978a0879f7	feat: Add mirrored and inverse task variations to ARC_1D_TASKS	2025-02-02 23:21:46 +01:00
Andreas Koepf	9a1270dd95	add arc_1d dataset	2025-02-02 23:03:56 +01:00
Andreas Koepf (aider)	a060348a9c	fix: Resolve undefined task function references in arc_1d.py	2025-02-02 22:49:28 +01:00
Andreas Koepf (aider)	b599d6e1a2	feat: Add Arc1D dataset with comprehensive task generation and configuration	2025-02-02 22:49:00 +01:00
Andreas Koepf (aider)	905ef7b89d	feat: Add missing task transformation imports to test_arc_1d.py	2025-02-02 22:42:43 +01:00
Andreas Koepf (aider)	84e4f1c5bc	feat: Add task augmentation functions mirror, inverse, and identity to arc_1d.py	2025-02-02 22:42:21 +01:00
Andreas Koepf	01cc239746	add quantum lock answer format hint	2025-02-02 22:35:43 +01:00
Andreas Koepf	82196bd2df	bump version to 0.1.3, uploaded to pypi	2025-02-02 22:26:24 +01:00
Andreas Koepf	057b9f2034	auto-load simple/intermediate integration tasks, stable order for n_queens (set was not stable)	2025-02-02 22:18:54 +01:00
Andreas Koepf (aider)	751773828f	test: Add unit test for score_answer method in N-Queens dataset	2025-02-02 22:15:49 +01:00
Andreas Koepf	b026774708	refactor: Update test cases to use 'solutions' instead of 'solution' in metadata	2025-02-02 22:15:47 +01:00
Andreas Köpf	abd41814e2	Merge pull request #46 from joesharratt1229/feat/integration_dataset Simple and intermediate integration problems dataset generators	2025-02-02 21:58:17 +01:00
Andreas Koepf	ccff85f81c	run scripts/generate_gallery.py	2025-02-02 21:56:17 +01:00
Andreas Köpf	3dd5a4df2e	Merge pull request #47 from zafstojano/feat/n-queens feat(env): N Queens	2025-02-02 21:54:02 +01:00
Cavit Erginsoy	372e778c26	improved word quality, removed extremly rares	2025-02-02 19:24:53 +00:00
joesharratt1229	b0d21cf664	added score_answer implementation and tests	2025-02-02 17:18:56 +00:00
Andreas Köpf	c4c0897fe0	Merge pull request #48 from rishabhranawat/aiw README updates with some recently added datasets.	2025-02-02 17:28:41 +01:00
Andreas Koepf	1df952001e	update gallery SyllogismDataset	2025-02-02 17:28:01 +01:00
rishabhranawat	b69cb27f75	Merge branch 'aiw' of https://github.com/rishabhranawat/reasoning-gym into aiw	2025-02-02 08:26:30 -08:00
rishabhranawat	519999ff89	Update dataset list w/ some missing logic datasets	2025-02-02 08:26:05 -08:00
Andreas Koepf	5dd4c0e831	change parameter order for basic arc tasks	2025-02-02 17:25:37 +01:00
Andreas Koepf (aider)	56ded2c299	feat: Improve syllogism sentence formatting for natural language	2025-02-02 17:23:02 +01:00
Zafir Stojanovski	1912c571f9	cap N at 12	2025-02-02 16:52:36 +01:00
Zafir Stojanovski	c74b600085	n queens	2025-02-02 16:47:21 +01:00
Andreas Koepf (aider)	28c30c69d1	fix: Correct argument passing in ARC 1D task test lambda functions	2025-02-02 16:43:25 +01:00
Andreas Koepf (aider)	2d3012d5ae	fix: Update test_arc_1d.py to handle task function argument order	2025-02-02 16:42:46 +01:00

1 2 3 4 5 ...

449 commits