Commit graph

434 commits

Author SHA1 Message Date
Rich Jones
4d950e562a cleanup 2025-02-03 16:47:29 +01:00
Rich Jones
7274f79c50 precommit hook linting 2025-02-03 14:40:58 +01:00
Rich Jones
0c9094e9f4 adds zebrapuzzles 2025-02-03 14:34:57 +01:00
Andreas Koepf
5d84b6bec5 update GALLERY.md 2025-02-03 12:42:11 +01:00
Andreas Köpf
57c605002a
Merge pull request #50 from joenorton/toh-score-answer
feat: toh scoring
2025-02-03 12:39:45 +01:00
Andreas Köpf
00961bc72a
Merge pull request #39 from joenorton/palindrome_generation
feat: add palindrome_generation
2025-02-03 10:10:29 +01:00
Joe Norton
731d36f43f add palindrome score_answer
add palindrome score_answer & test
2025-02-02 18:04:47 -08:00
Joe Norton
9841f64ccd add dependency 2025-02-02 16:46:07 -08:00
Joe Norton
8222823c28 add toh score_answer 2025-02-02 16:37:20 -08:00
Andreas Koepf
3aeec71523 add attribution for arc-1d and unit tests 2025-02-02 23:45:25 +01:00
Andreas Koepf (aider)
a9549057e9 test: Add scoring tests for Arc1D dataset answer evaluation 2025-02-02 23:31:20 +01:00
Andreas Koepf
b7532f66ca test: Remove test_arc_1d.py file from tests directory 2025-02-02 23:30:15 +01:00
Andreas Koepf (aider)
978a0879f7 feat: Add mirrored and inverse task variations to ARC_1D_TASKS 2025-02-02 23:21:46 +01:00
Andreas Koepf
9a1270dd95 add arc_1d dataset 2025-02-02 23:03:56 +01:00
Andreas Koepf (aider)
a060348a9c fix: Resolve undefined task function references in arc_1d.py 2025-02-02 22:49:28 +01:00
Andreas Koepf (aider)
b599d6e1a2 feat: Add Arc1D dataset with comprehensive task generation and configuration 2025-02-02 22:49:00 +01:00
Andreas Koepf (aider)
905ef7b89d feat: Add missing task transformation imports to test_arc_1d.py 2025-02-02 22:42:43 +01:00
Andreas Koepf (aider)
84e4f1c5bc feat: Add task augmentation functions mirror, inverse, and identity to arc_1d.py 2025-02-02 22:42:21 +01:00
Andreas Koepf
01cc239746 add quantum lock answer format hint 2025-02-02 22:35:43 +01:00
Andreas Koepf
82196bd2df bump version to 0.1.3, uploaded to pypi 2025-02-02 22:26:24 +01:00
Andreas Koepf
057b9f2034 auto-load simple/intermediate integration tasks, stable order for n_queens (set was not stable) 2025-02-02 22:18:54 +01:00
Andreas Koepf (aider)
751773828f test: Add unit test for score_answer method in N-Queens dataset 2025-02-02 22:15:49 +01:00
Andreas Koepf
b026774708 refactor: Update test cases to use 'solutions' instead of 'solution' in metadata 2025-02-02 22:15:47 +01:00
Andreas Köpf
abd41814e2
Merge pull request #46 from joesharratt1229/feat/integration_dataset
Simple and intermediate integration problems dataset generators
2025-02-02 21:58:17 +01:00
Andreas Koepf
ccff85f81c run scripts/generate_gallery.py 2025-02-02 21:56:17 +01:00
Andreas Köpf
3dd5a4df2e
Merge pull request #47 from zafstojano/feat/n-queens
feat(env): N Queens
2025-02-02 21:54:02 +01:00
joesharratt1229
b0d21cf664 added score_answer implementation and tests 2025-02-02 17:18:56 +00:00
Andreas Köpf
c4c0897fe0
Merge pull request #48 from rishabhranawat/aiw
README updates with some recently added datasets.
2025-02-02 17:28:41 +01:00
Andreas Koepf
1df952001e update gallery SyllogismDataset 2025-02-02 17:28:01 +01:00
rishabhranawat
b69cb27f75 Merge branch 'aiw' of https://github.com/rishabhranawat/reasoning-gym into aiw 2025-02-02 08:26:30 -08:00
rishabhranawat
519999ff89 Update dataset list w/ some missing logic datasets 2025-02-02 08:26:05 -08:00
Andreas Koepf
5dd4c0e831 change parameter order for basic arc tasks 2025-02-02 17:25:37 +01:00
Andreas Koepf (aider)
56ded2c299 feat: Improve syllogism sentence formatting for natural language 2025-02-02 17:23:02 +01:00
Zafir Stojanovski
1912c571f9 cap N at 12 2025-02-02 16:52:36 +01:00
Zafir Stojanovski
c74b600085 n queens 2025-02-02 16:47:21 +01:00
Andreas Koepf (aider)
28c30c69d1 fix: Correct argument passing in ARC 1D task test lambda functions 2025-02-02 16:43:25 +01:00
Andreas Koepf (aider)
2d3012d5ae fix: Update test_arc_1d.py to handle task function argument order 2025-02-02 16:42:46 +01:00
Andreas Koepf (aider)
d56e8c3a03 fix: Remove redundant parameters in ARC 1D task test suite 2025-02-02 16:42:21 +01:00
Andreas Koepf (aider)
f0ab9ec0d4 test: Add comprehensive unittest for arc_1d task functions 2025-02-02 16:40:39 +01:00
Andreas Koepf (aider)
da16467ca7 feat: Add five new 1D ARC task generation functions 2025-02-02 16:38:14 +01:00
Andreas Koepf (aider)
3714e6c5ff feat: Add five new 1D ARC task generation functions 2025-02-02 16:37:14 +01:00
Andreas Koepf (aider)
dc11f88c0b feat: Add new 1D ARC task generation functions for block manipulation 2025-02-02 16:36:19 +01:00
Andreas Koepf (aider)
9dac01fda7 feat: Add new 1D ARC task generation functions 2025-02-02 16:34:52 +01:00
Andreas Koepf (aider)
4c22fca7ed feat: Add new 1D task generation functions to arc_1d.py 2025-02-02 16:33:02 +01:00
Andreas Koepf
166e3d5f0d feat: Add arc_1d.py module for one-dimensional abstract reasoning challenges 2025-02-02 16:33:01 +01:00
joesharratt1229
f5838da534 Merge remote-tracking branch 'origin/main' into feat/integration_dataset 2025-02-02 15:31:07 +00:00
joesharratt1229
40e53b8bca added implementation of simple integration dataset 2025-02-02 15:30:22 +00:00
joesharratt1229
76faad9dcf created test script for intermediate integration dataset generator 2025-02-02 15:30:01 +00:00
joesharratt1229
420a44bd79 added impl of simple integration dataset generator 2025-02-02 15:29:24 +00:00
joesharratt1229
0eb0247ebd added register dataset to script 2025-02-02 15:28:52 +00:00