Andreas Köpf
052d76d2ca
Merge pull request #55 from Miserlou/rich/fflogic
...
Adds Zebra/Murdle/Einstein/Grid Style Puzzles
2025-02-03 17:52:22 +01:00
Rich Jones
4a19aa8f14
readme
2025-02-03 16:49:18 +01:00
Rich Jones
4d950e562a
cleanup
2025-02-03 16:47:29 +01:00
Andreas Köpf
c6fbff7d8f
Merge pull request #52 from cavit99/main
...
Improve Word Ladder and add complete example suite
2025-02-03 15:16:29 +01:00
Rich Jones
7274f79c50
precommit hook linting
2025-02-03 14:40:58 +01:00
Rich Jones
0c9094e9f4
adds zebrapuzzles
2025-02-03 14:34:57 +01:00
Andreas Koepf
5d84b6bec5
update GALLERY.md
2025-02-03 12:42:11 +01:00
Andreas Köpf
57c605002a
Merge pull request #50 from joenorton/toh-score-answer
...
feat: toh scoring
2025-02-03 12:39:45 +01:00
Cavit Erginsoy
aff0fecef4
lint
2025-02-03 11:35:30 +00:00
Andreas Köpf
00961bc72a
Merge pull request #39 from joenorton/palindrome_generation
...
feat: add palindrome_generation
2025-02-03 10:10:29 +01:00
Cavit Erginsoy
9b1068ea39
Merge remote-tracking branch 'upstream/main'
2025-02-03 07:44:32 +00:00
Cavit Erginsoy
15f5c8158b
Add word ladder dataset to GALLERY.md
...
- Documented word ladder dataset configuration and generation details
- Included three example tasks demonstrating word transformation scenarios
- Updated table of contents with new dataset entry
2025-02-03 07:25:11 +00:00
Cavit Erginsoy
4355d5a5fe
Comprehensive test suite for word ladder dataset generation
...
- Added extensive test coverage for word ladder configuration validation
- Implemented tests for neighbor computation, path finding, and graph caching
- Included performance and edge case tests
- Verified solution optimality and path generation logic
- Added tests for word set loading and pair generation
2025-02-03 07:22:03 +00:00
Cavit Erginsoy
d5065955a8
Refactor word ladder generation with improved validation and graph-based path finding
...
- Enhanced configuration validation with size and length constraints
- Implemented graph-based neighbor computation and caching
- Simplified path finding algorithm with more robust length checking
- Added more flexible word set loading with configurable length ranges
- Improved error handling for dataset generation
2025-02-03 07:21:43 +00:00
Cavit Erginsoy
7b61fc5043
Completed: full example suite
2025-02-03 07:21:12 +00:00
Cavit Erginsoy
08f300911f
add .DS_Store
2025-02-03 07:20:17 +00:00
Cavit Erginsoy
ade33e1a22
filtered out lesser known words to aid model learning ease
2025-02-03 07:19:30 +00:00
Cavit Erginsoy
c0a8a9e46f
update test to match
2025-02-03 03:27:49 +00:00
Joe Norton
731d36f43f
add palindrome score_answer
...
add palindrome score_answer & test
2025-02-02 18:04:47 -08:00
Joe Norton
9841f64ccd
add dependency
2025-02-02 16:46:07 -08:00
Joe Norton
8222823c28
add toh score_answer
2025-02-02 16:37:20 -08:00
Andreas Koepf
3aeec71523
add attribution for arc-1d and unit tests
2025-02-02 23:45:25 +01:00
Andreas Koepf (aider)
a9549057e9
test: Add scoring tests for Arc1D dataset answer evaluation
2025-02-02 23:31:20 +01:00
Andreas Koepf
b7532f66ca
test: Remove test_arc_1d.py file from tests directory
2025-02-02 23:30:15 +01:00
Andreas Koepf (aider)
978a0879f7
feat: Add mirrored and inverse task variations to ARC_1D_TASKS
2025-02-02 23:21:46 +01:00
Andreas Koepf
9a1270dd95
add arc_1d dataset
2025-02-02 23:03:56 +01:00
Andreas Koepf (aider)
a060348a9c
fix: Resolve undefined task function references in arc_1d.py
2025-02-02 22:49:28 +01:00
Andreas Koepf (aider)
b599d6e1a2
feat: Add Arc1D dataset with comprehensive task generation and configuration
2025-02-02 22:49:00 +01:00
Andreas Koepf (aider)
905ef7b89d
feat: Add missing task transformation imports to test_arc_1d.py
2025-02-02 22:42:43 +01:00
Andreas Koepf (aider)
84e4f1c5bc
feat: Add task augmentation functions mirror, inverse, and identity to arc_1d.py
2025-02-02 22:42:21 +01:00
Andreas Koepf
01cc239746
add quantum lock answer format hint
2025-02-02 22:35:43 +01:00
Andreas Koepf
82196bd2df
bump version to 0.1.3, uploaded to pypi
2025-02-02 22:26:24 +01:00
Andreas Koepf
057b9f2034
auto-load simple/intermediate integration tasks, stable order for n_queens (set was not stable)
2025-02-02 22:18:54 +01:00
Andreas Koepf (aider)
751773828f
test: Add unit test for score_answer method in N-Queens dataset
2025-02-02 22:15:49 +01:00
Andreas Koepf
b026774708
refactor: Update test cases to use 'solutions' instead of 'solution' in metadata
2025-02-02 22:15:47 +01:00
Andreas Köpf
abd41814e2
Merge pull request #46 from joesharratt1229/feat/integration_dataset
...
Simple and intermediate integration problems dataset generators
2025-02-02 21:58:17 +01:00
Andreas Koepf
ccff85f81c
run scripts/generate_gallery.py
2025-02-02 21:56:17 +01:00
Andreas Köpf
3dd5a4df2e
Merge pull request #47 from zafstojano/feat/n-queens
...
feat(env): N Queens
2025-02-02 21:54:02 +01:00
Cavit Erginsoy
372e778c26
improved word quality, removed extremly rares
2025-02-02 19:24:53 +00:00
joesharratt1229
b0d21cf664
added score_answer implementation and tests
2025-02-02 17:18:56 +00:00
Andreas Köpf
c4c0897fe0
Merge pull request #48 from rishabhranawat/aiw
...
README updates with some recently added datasets.
2025-02-02 17:28:41 +01:00
Andreas Koepf
1df952001e
update gallery SyllogismDataset
2025-02-02 17:28:01 +01:00
rishabhranawat
b69cb27f75
Merge branch 'aiw' of https://github.com/rishabhranawat/reasoning-gym into aiw
2025-02-02 08:26:30 -08:00
rishabhranawat
519999ff89
Update dataset list w/ some missing logic datasets
2025-02-02 08:26:05 -08:00
Andreas Koepf
5dd4c0e831
change parameter order for basic arc tasks
2025-02-02 17:25:37 +01:00
Andreas Koepf (aider)
56ded2c299
feat: Improve syllogism sentence formatting for natural language
2025-02-02 17:23:02 +01:00
Zafir Stojanovski
1912c571f9
cap N at 12
2025-02-02 16:52:36 +01:00
Zafir Stojanovski
c74b600085
n queens
2025-02-02 16:47:21 +01:00
Andreas Koepf (aider)
28c30c69d1
fix: Correct argument passing in ARC 1D task test lambda functions
2025-02-02 16:43:25 +01:00
Andreas Koepf (aider)
2d3012d5ae
fix: Update test_arc_1d.py to handle task function argument order
2025-02-02 16:42:46 +01:00