Commit graph

990 commits

Author SHA1 Message Date
Cavit Erginsoy
c0a16d7f2b add .DS_Store 2025-02-03 07:20:17 +00:00
Cavit Erginsoy
bb8313db6a filtered out lesser known words to aid model learning ease 2025-02-03 07:19:30 +00:00
Cavit Erginsoy
8b8c514f9c update test to match 2025-02-03 03:27:49 +00:00
Joe Norton
ff8f627f8d add palindrome score_answer
add palindrome score_answer & test
2025-02-02 18:04:47 -08:00
Joe Norton
0d217ead76 add dependency 2025-02-02 16:46:07 -08:00
Joe Norton
b5b1ea1e31 add toh score_answer 2025-02-02 16:37:20 -08:00
Andreas Koepf
8dc496bc35 add attribution for arc-1d and unit tests 2025-02-02 23:45:25 +01:00
Andreas Koepf (aider)
5cf57500d6 test: Add scoring tests for Arc1D dataset answer evaluation 2025-02-02 23:31:20 +01:00
Andreas Koepf
47fa699745 test: Remove test_arc_1d.py file from tests directory 2025-02-02 23:30:15 +01:00
Andreas Koepf (aider)
2b978de850 feat: Add mirrored and inverse task variations to ARC_1D_TASKS 2025-02-02 23:21:46 +01:00
Andreas Koepf
f8c7807892 add arc_1d dataset 2025-02-02 23:03:56 +01:00
Andreas Koepf (aider)
52c86ed327 fix: Resolve undefined task function references in arc_1d.py 2025-02-02 22:49:28 +01:00
Andreas Koepf (aider)
67027b828f feat: Add Arc1D dataset with comprehensive task generation and configuration 2025-02-02 22:49:00 +01:00
Andreas Koepf (aider)
2120e4ed1b feat: Add missing task transformation imports to test_arc_1d.py 2025-02-02 22:42:43 +01:00
Andreas Koepf (aider)
017148d78d feat: Add task augmentation functions mirror, inverse, and identity to arc_1d.py 2025-02-02 22:42:21 +01:00
Andreas Koepf
6ecd25c283 add quantum lock answer format hint 2025-02-02 22:35:43 +01:00
Andreas Koepf
63a8c94d85 bump version to 0.1.3, uploaded to pypi 2025-02-02 22:26:24 +01:00
Andreas Koepf
db65e8451b auto-load simple/intermediate integration tasks, stable order for n_queens (set was not stable) 2025-02-02 22:18:54 +01:00
Andreas Koepf (aider)
31818d3e0b test: Add unit test for score_answer method in N-Queens dataset 2025-02-02 22:15:49 +01:00
Andreas Koepf
5282d9db31 refactor: Update test cases to use 'solutions' instead of 'solution' in metadata 2025-02-02 22:15:47 +01:00
Andreas Köpf
5889cac5ab Merge pull request #46 from joesharratt1229/feat/integration_dataset
Simple and intermediate integration problems dataset generators
2025-02-02 21:58:17 +01:00
Andreas Koepf
d1c198344a run scripts/generate_gallery.py 2025-02-02 21:56:17 +01:00
Andreas Köpf
4c577f4051 Merge pull request #47 from zafstojano/feat/n-queens
feat(env): N Queens
2025-02-02 21:54:02 +01:00
Cavit Erginsoy
68847000d0 improved word quality, removed extremly rares 2025-02-02 19:24:53 +00:00
joesharratt1229
7c02901ee3 added score_answer implementation and tests 2025-02-02 17:18:56 +00:00
Andreas Köpf
92c4738967 Merge pull request #48 from rishabhranawat/aiw
README updates with some recently added datasets.
2025-02-02 17:28:41 +01:00
Andreas Koepf
bf9f170b69 update gallery SyllogismDataset 2025-02-02 17:28:01 +01:00
rishabhranawat
1fc0a7d0ef Merge branch 'aiw' of https://github.com/rishabhranawat/reasoning-gym into aiw 2025-02-02 08:26:30 -08:00
rishabhranawat
7145d303ae Update dataset list w/ some missing logic datasets 2025-02-02 08:26:05 -08:00
Andreas Koepf
604db012c3 change parameter order for basic arc tasks 2025-02-02 17:25:37 +01:00
Andreas Koepf (aider)
2409b3cda2 feat: Improve syllogism sentence formatting for natural language 2025-02-02 17:23:02 +01:00
Zafir Stojanovski
26f72f481c cap N at 12 2025-02-02 16:52:36 +01:00
Zafir Stojanovski
1ca472dbd7 n queens 2025-02-02 16:47:21 +01:00
Andreas Koepf (aider)
dad72bc6d0 fix: Correct argument passing in ARC 1D task test lambda functions 2025-02-02 16:43:25 +01:00
Andreas Koepf (aider)
1da869862a fix: Update test_arc_1d.py to handle task function argument order 2025-02-02 16:42:46 +01:00
Andreas Koepf (aider)
6e9f0879ac fix: Remove redundant parameters in ARC 1D task test suite 2025-02-02 16:42:21 +01:00
Andreas Koepf (aider)
6cef2589fe test: Add comprehensive unittest for arc_1d task functions 2025-02-02 16:40:39 +01:00
Andreas Koepf (aider)
845a80711f feat: Add five new 1D ARC task generation functions 2025-02-02 16:38:14 +01:00
Andreas Koepf (aider)
e55266c5e6 feat: Add five new 1D ARC task generation functions 2025-02-02 16:37:14 +01:00
Andreas Koepf (aider)
5b4998049a feat: Add new 1D ARC task generation functions for block manipulation 2025-02-02 16:36:19 +01:00
Andreas Koepf (aider)
c3a527aed3 feat: Add new 1D ARC task generation functions 2025-02-02 16:34:52 +01:00
Andreas Koepf (aider)
185c5d7504 feat: Add new 1D task generation functions to arc_1d.py 2025-02-02 16:33:02 +01:00
Andreas Koepf
8a153d9857 feat: Add arc_1d.py module for one-dimensional abstract reasoning challenges 2025-02-02 16:33:01 +01:00
joesharratt1229
50c94ed244 Merge remote-tracking branch 'origin/main' into feat/integration_dataset 2025-02-02 15:31:07 +00:00
joesharratt1229
3f04139834 added implementation of simple integration dataset 2025-02-02 15:30:22 +00:00
joesharratt1229
2656b98a1d created test script for intermediate integration dataset generator 2025-02-02 15:30:01 +00:00
joesharratt1229
b1fa387e5d added impl of simple integration dataset generator 2025-02-02 15:29:24 +00:00
joesharratt1229
a9ea2df7a0 added register dataset to script 2025-02-02 15:28:52 +00:00
joesharratt1229
ed1492ba05 added intermediate integration dataset generator 2025-02-02 15:27:08 +00:00
Andreas Koepf
94eeff3255 post merge formatting 2025-02-02 15:24:39 +01:00