Commit graph

449 commits

Author SHA1 Message Date
Andreas Köpf
052d76d2ca
Merge pull request #55 from Miserlou/rich/fflogic
Adds Zebra/Murdle/Einstein/Grid Style Puzzles
2025-02-03 17:52:22 +01:00
Rich Jones
4a19aa8f14 readme 2025-02-03 16:49:18 +01:00
Rich Jones
4d950e562a cleanup 2025-02-03 16:47:29 +01:00
Andreas Köpf
c6fbff7d8f
Merge pull request #52 from cavit99/main
Improve Word Ladder and add complete example suite
2025-02-03 15:16:29 +01:00
Rich Jones
7274f79c50 precommit hook linting 2025-02-03 14:40:58 +01:00
Rich Jones
0c9094e9f4 adds zebrapuzzles 2025-02-03 14:34:57 +01:00
Andreas Koepf
5d84b6bec5 update GALLERY.md 2025-02-03 12:42:11 +01:00
Andreas Köpf
57c605002a
Merge pull request #50 from joenorton/toh-score-answer
feat: toh scoring
2025-02-03 12:39:45 +01:00
Cavit Erginsoy
aff0fecef4 lint 2025-02-03 11:35:30 +00:00
Andreas Köpf
00961bc72a
Merge pull request #39 from joenorton/palindrome_generation
feat: add palindrome_generation
2025-02-03 10:10:29 +01:00
Cavit Erginsoy
9b1068ea39 Merge remote-tracking branch 'upstream/main' 2025-02-03 07:44:32 +00:00
Cavit Erginsoy
15f5c8158b Add word ladder dataset to GALLERY.md
- Documented word ladder dataset configuration and generation details
- Included three example tasks demonstrating word transformation scenarios
- Updated table of contents with new dataset entry
2025-02-03 07:25:11 +00:00
Cavit Erginsoy
4355d5a5fe Comprehensive test suite for word ladder dataset generation
- Added extensive test coverage for word ladder configuration validation
- Implemented tests for neighbor computation, path finding, and graph caching
- Included performance and edge case tests
- Verified solution optimality and path generation logic
- Added tests for word set loading and pair generation
2025-02-03 07:22:03 +00:00
Cavit Erginsoy
d5065955a8 Refactor word ladder generation with improved validation and graph-based path finding
- Enhanced configuration validation with size and length constraints
- Implemented graph-based neighbor computation and caching
- Simplified path finding algorithm with more robust length checking
- Added more flexible word set loading with configurable length ranges
- Improved error handling for dataset generation
2025-02-03 07:21:43 +00:00
Cavit Erginsoy
7b61fc5043 Completed: full example suite 2025-02-03 07:21:12 +00:00
Cavit Erginsoy
08f300911f add .DS_Store 2025-02-03 07:20:17 +00:00
Cavit Erginsoy
ade33e1a22 filtered out lesser known words to aid model learning ease 2025-02-03 07:19:30 +00:00
Cavit Erginsoy
c0a8a9e46f update test to match 2025-02-03 03:27:49 +00:00
Joe Norton
731d36f43f add palindrome score_answer
add palindrome score_answer & test
2025-02-02 18:04:47 -08:00
Joe Norton
9841f64ccd add dependency 2025-02-02 16:46:07 -08:00
Joe Norton
8222823c28 add toh score_answer 2025-02-02 16:37:20 -08:00
Andreas Koepf
3aeec71523 add attribution for arc-1d and unit tests 2025-02-02 23:45:25 +01:00
Andreas Koepf (aider)
a9549057e9 test: Add scoring tests for Arc1D dataset answer evaluation 2025-02-02 23:31:20 +01:00
Andreas Koepf
b7532f66ca test: Remove test_arc_1d.py file from tests directory 2025-02-02 23:30:15 +01:00
Andreas Koepf (aider)
978a0879f7 feat: Add mirrored and inverse task variations to ARC_1D_TASKS 2025-02-02 23:21:46 +01:00
Andreas Koepf
9a1270dd95 add arc_1d dataset 2025-02-02 23:03:56 +01:00
Andreas Koepf (aider)
a060348a9c fix: Resolve undefined task function references in arc_1d.py 2025-02-02 22:49:28 +01:00
Andreas Koepf (aider)
b599d6e1a2 feat: Add Arc1D dataset with comprehensive task generation and configuration 2025-02-02 22:49:00 +01:00
Andreas Koepf (aider)
905ef7b89d feat: Add missing task transformation imports to test_arc_1d.py 2025-02-02 22:42:43 +01:00
Andreas Koepf (aider)
84e4f1c5bc feat: Add task augmentation functions mirror, inverse, and identity to arc_1d.py 2025-02-02 22:42:21 +01:00
Andreas Koepf
01cc239746 add quantum lock answer format hint 2025-02-02 22:35:43 +01:00
Andreas Koepf
82196bd2df bump version to 0.1.3, uploaded to pypi 2025-02-02 22:26:24 +01:00
Andreas Koepf
057b9f2034 auto-load simple/intermediate integration tasks, stable order for n_queens (set was not stable) 2025-02-02 22:18:54 +01:00
Andreas Koepf (aider)
751773828f test: Add unit test for score_answer method in N-Queens dataset 2025-02-02 22:15:49 +01:00
Andreas Koepf
b026774708 refactor: Update test cases to use 'solutions' instead of 'solution' in metadata 2025-02-02 22:15:47 +01:00
Andreas Köpf
abd41814e2
Merge pull request #46 from joesharratt1229/feat/integration_dataset
Simple and intermediate integration problems dataset generators
2025-02-02 21:58:17 +01:00
Andreas Koepf
ccff85f81c run scripts/generate_gallery.py 2025-02-02 21:56:17 +01:00
Andreas Köpf
3dd5a4df2e
Merge pull request #47 from zafstojano/feat/n-queens
feat(env): N Queens
2025-02-02 21:54:02 +01:00
Cavit Erginsoy
372e778c26 improved word quality, removed extremly rares 2025-02-02 19:24:53 +00:00
joesharratt1229
b0d21cf664 added score_answer implementation and tests 2025-02-02 17:18:56 +00:00
Andreas Köpf
c4c0897fe0
Merge pull request #48 from rishabhranawat/aiw
README updates with some recently added datasets.
2025-02-02 17:28:41 +01:00
Andreas Koepf
1df952001e update gallery SyllogismDataset 2025-02-02 17:28:01 +01:00
rishabhranawat
b69cb27f75 Merge branch 'aiw' of https://github.com/rishabhranawat/reasoning-gym into aiw 2025-02-02 08:26:30 -08:00
rishabhranawat
519999ff89 Update dataset list w/ some missing logic datasets 2025-02-02 08:26:05 -08:00
Andreas Koepf
5dd4c0e831 change parameter order for basic arc tasks 2025-02-02 17:25:37 +01:00
Andreas Koepf (aider)
56ded2c299 feat: Improve syllogism sentence formatting for natural language 2025-02-02 17:23:02 +01:00
Zafir Stojanovski
1912c571f9 cap N at 12 2025-02-02 16:52:36 +01:00
Zafir Stojanovski
c74b600085 n queens 2025-02-02 16:47:21 +01:00
Andreas Koepf (aider)
28c30c69d1 fix: Correct argument passing in ARC 1D task test lambda functions 2025-02-02 16:43:25 +01:00
Andreas Koepf (aider)
2d3012d5ae fix: Update test_arc_1d.py to handle task function argument order 2025-02-02 16:42:46 +01:00