Commit graph

533 commits

Author SHA1 Message Date
Andreas Koepf (aider)
a9549057e9 test: Add scoring tests for Arc1D dataset answer evaluation 2025-02-02 23:31:20 +01:00
Andreas Koepf
b7532f66ca test: Remove test_arc_1d.py file from tests directory 2025-02-02 23:30:15 +01:00
Andreas Koepf (aider)
978a0879f7 feat: Add mirrored and inverse task variations to ARC_1D_TASKS 2025-02-02 23:21:46 +01:00
Andreas Koepf
9a1270dd95 add arc_1d dataset 2025-02-02 23:03:56 +01:00
Andreas Koepf (aider)
a060348a9c fix: Resolve undefined task function references in arc_1d.py 2025-02-02 22:49:28 +01:00
Andreas Koepf (aider)
b599d6e1a2 feat: Add Arc1D dataset with comprehensive task generation and configuration 2025-02-02 22:49:00 +01:00
Andreas Koepf (aider)
905ef7b89d feat: Add missing task transformation imports to test_arc_1d.py 2025-02-02 22:42:43 +01:00
Andreas Koepf (aider)
84e4f1c5bc feat: Add task augmentation functions mirror, inverse, and identity to arc_1d.py 2025-02-02 22:42:21 +01:00
Andreas Koepf
01cc239746 add quantum lock answer format hint 2025-02-02 22:35:43 +01:00
Andreas Koepf
82196bd2df bump version to 0.1.3, uploaded to pypi 2025-02-02 22:26:24 +01:00
Andreas Koepf
057b9f2034 auto-load simple/intermediate integration tasks, stable order for n_queens (set was not stable) 2025-02-02 22:18:54 +01:00
Andreas Koepf (aider)
751773828f test: Add unit test for score_answer method in N-Queens dataset 2025-02-02 22:15:49 +01:00
Andreas Koepf
b026774708 refactor: Update test cases to use 'solutions' instead of 'solution' in metadata 2025-02-02 22:15:47 +01:00
Andreas Köpf
abd41814e2
Merge pull request #46 from joesharratt1229/feat/integration_dataset
Simple and intermediate integration problems dataset generators
2025-02-02 21:58:17 +01:00
Andreas Koepf
ccff85f81c run scripts/generate_gallery.py 2025-02-02 21:56:17 +01:00
Andreas Köpf
3dd5a4df2e
Merge pull request #47 from zafstojano/feat/n-queens
feat(env): N Queens
2025-02-02 21:54:02 +01:00
Cavit Erginsoy
372e778c26 improved word quality, removed extremly rares 2025-02-02 19:24:53 +00:00
joesharratt1229
b0d21cf664 added score_answer implementation and tests 2025-02-02 17:18:56 +00:00
Andreas Köpf
c4c0897fe0
Merge pull request #48 from rishabhranawat/aiw
README updates with some recently added datasets.
2025-02-02 17:28:41 +01:00
Andreas Koepf
1df952001e update gallery SyllogismDataset 2025-02-02 17:28:01 +01:00
rishabhranawat
b69cb27f75 Merge branch 'aiw' of https://github.com/rishabhranawat/reasoning-gym into aiw 2025-02-02 08:26:30 -08:00
rishabhranawat
519999ff89 Update dataset list w/ some missing logic datasets 2025-02-02 08:26:05 -08:00
Andreas Koepf
5dd4c0e831 change parameter order for basic arc tasks 2025-02-02 17:25:37 +01:00
Andreas Koepf (aider)
56ded2c299 feat: Improve syllogism sentence formatting for natural language 2025-02-02 17:23:02 +01:00
Zafir Stojanovski
1912c571f9 cap N at 12 2025-02-02 16:52:36 +01:00
Zafir Stojanovski
c74b600085 n queens 2025-02-02 16:47:21 +01:00
Andreas Koepf (aider)
28c30c69d1 fix: Correct argument passing in ARC 1D task test lambda functions 2025-02-02 16:43:25 +01:00
Andreas Koepf (aider)
2d3012d5ae fix: Update test_arc_1d.py to handle task function argument order 2025-02-02 16:42:46 +01:00
Andreas Koepf (aider)
d56e8c3a03 fix: Remove redundant parameters in ARC 1D task test suite 2025-02-02 16:42:21 +01:00
Andreas Koepf (aider)
f0ab9ec0d4 test: Add comprehensive unittest for arc_1d task functions 2025-02-02 16:40:39 +01:00
Andreas Koepf (aider)
da16467ca7 feat: Add five new 1D ARC task generation functions 2025-02-02 16:38:14 +01:00
Andreas Koepf (aider)
3714e6c5ff feat: Add five new 1D ARC task generation functions 2025-02-02 16:37:14 +01:00
Andreas Koepf (aider)
dc11f88c0b feat: Add new 1D ARC task generation functions for block manipulation 2025-02-02 16:36:19 +01:00
Andreas Koepf (aider)
9dac01fda7 feat: Add new 1D ARC task generation functions 2025-02-02 16:34:52 +01:00
Andreas Koepf (aider)
4c22fca7ed feat: Add new 1D task generation functions to arc_1d.py 2025-02-02 16:33:02 +01:00
Andreas Koepf
166e3d5f0d feat: Add arc_1d.py module for one-dimensional abstract reasoning challenges 2025-02-02 16:33:01 +01:00
joesharratt1229
f5838da534 Merge remote-tracking branch 'origin/main' into feat/integration_dataset 2025-02-02 15:31:07 +00:00
joesharratt1229
40e53b8bca added implementation of simple integration dataset 2025-02-02 15:30:22 +00:00
joesharratt1229
76faad9dcf created test script for intermediate integration dataset generator 2025-02-02 15:30:01 +00:00
joesharratt1229
420a44bd79 added impl of simple integration dataset generator 2025-02-02 15:29:24 +00:00
joesharratt1229
0eb0247ebd added register dataset to script 2025-02-02 15:28:52 +00:00
joesharratt1229
8528e39764 added intermediate integration dataset generator 2025-02-02 15:27:08 +00:00
Andreas Koepf
8b0f634f4c post merge formatting 2025-02-02 15:24:39 +01:00
Andreas Köpf
aa172a193b
Merge pull request #43 from open-thought/calendar-arithmetic
added calendar-arithmetic tasks
2025-02-02 15:22:50 +01:00
benjamrio
943651c15b added calendar-arithmetic tasks 2025-02-02 14:54:32 +01:00
Andreas Koepf
f396d3df60 post merge lint 2025-02-02 10:04:18 +01:00
Andreas Köpf
02cfa9556a
Merge pull request #41 from rishabhranawat/aiw
Add Alice In Wonderland Problem Procedural Dataset
2025-02-02 10:00:04 +01:00
Andreas Koepf (aider)
4e9fc4baad refactor: Use field default_factory TimeIntervalsConfig, AdvancedGeometryConfig 2025-02-02 09:55:51 +01:00
abdulhakeem
5d0ad82034 Add EOL to test_generator_files 2025-02-01 20:41:31 -06:00
abdulhakeem
715102c277 Remove .DS_Store 2025-02-01 20:39:37 -06:00