Commit graph

150 commits

Author SHA1 Message Date
Cavit Erginsoy
6c564b3dd9 lint 2025-02-03 11:35:30 +00:00
Cavit Erginsoy
1e27021e11 Merge remote-tracking branch 'upstream/main' 2025-02-03 07:44:32 +00:00
Cavit Erginsoy
35515310f0 Comprehensive test suite for word ladder dataset generation
- Added extensive test coverage for word ladder configuration validation
- Implemented tests for neighbor computation, path finding, and graph caching
- Included performance and edge case tests
- Verified solution optimality and path generation logic
- Added tests for word set loading and pair generation
2025-02-03 07:22:03 +00:00
Cavit Erginsoy
8b8c514f9c update test to match 2025-02-03 03:27:49 +00:00
Andreas Koepf
8dc496bc35 add attribution for arc-1d and unit tests 2025-02-02 23:45:25 +01:00
Andreas Koepf (aider)
5cf57500d6 test: Add scoring tests for Arc1D dataset answer evaluation 2025-02-02 23:31:20 +01:00
Andreas Koepf
47fa699745 test: Remove test_arc_1d.py file from tests directory 2025-02-02 23:30:15 +01:00
Andreas Koepf
f8c7807892 add arc_1d dataset 2025-02-02 23:03:56 +01:00
Andreas Koepf (aider)
2120e4ed1b feat: Add missing task transformation imports to test_arc_1d.py 2025-02-02 22:42:43 +01:00
Andreas Koepf (aider)
017148d78d feat: Add task augmentation functions mirror, inverse, and identity to arc_1d.py 2025-02-02 22:42:21 +01:00
Andreas Koepf
db65e8451b auto-load simple/intermediate integration tasks, stable order for n_queens (set was not stable) 2025-02-02 22:18:54 +01:00
Andreas Koepf (aider)
31818d3e0b test: Add unit test for score_answer method in N-Queens dataset 2025-02-02 22:15:49 +01:00
Andreas Koepf
5282d9db31 refactor: Update test cases to use 'solutions' instead of 'solution' in metadata 2025-02-02 22:15:47 +01:00
Andreas Köpf
5889cac5ab Merge pull request #46 from joesharratt1229/feat/integration_dataset
Simple and intermediate integration problems dataset generators
2025-02-02 21:58:17 +01:00
Andreas Köpf
4c577f4051 Merge pull request #47 from zafstojano/feat/n-queens
feat(env): N Queens
2025-02-02 21:54:02 +01:00
joesharratt1229
7c02901ee3 added score_answer implementation and tests 2025-02-02 17:18:56 +00:00
Andreas Koepf
604db012c3 change parameter order for basic arc tasks 2025-02-02 17:25:37 +01:00
Zafir Stojanovski
1ca472dbd7 n queens 2025-02-02 16:47:21 +01:00
Andreas Koepf (aider)
dad72bc6d0 fix: Correct argument passing in ARC 1D task test lambda functions 2025-02-02 16:43:25 +01:00
Andreas Koepf (aider)
1da869862a fix: Update test_arc_1d.py to handle task function argument order 2025-02-02 16:42:46 +01:00
Andreas Koepf (aider)
6e9f0879ac fix: Remove redundant parameters in ARC 1D task test suite 2025-02-02 16:42:21 +01:00
Andreas Koepf (aider)
6cef2589fe test: Add comprehensive unittest for arc_1d task functions 2025-02-02 16:40:39 +01:00
joesharratt1229
50c94ed244 Merge remote-tracking branch 'origin/main' into feat/integration_dataset 2025-02-02 15:31:07 +00:00
joesharratt1229
3f04139834 added implementation of simple integration dataset 2025-02-02 15:30:22 +00:00
joesharratt1229
2656b98a1d created test script for intermediate integration dataset generator 2025-02-02 15:30:01 +00:00
Andreas Koepf
94eeff3255 post merge formatting 2025-02-02 15:24:39 +01:00
benjamrio
7acd4cb1e5 added calendar-arithmetic tasks 2025-02-02 14:54:32 +01:00
Andreas Koepf
ccf282cc90 post merge lint 2025-02-02 10:04:18 +01:00
rishabhranawat
dd4772cd09 [aiw] remove output format enum 2025-02-01 16:31:45 -08:00
rishabhranawat
ad73861fac [aiw] remove output_formats style and change return type to a standard format 2025-02-01 16:30:05 -08:00
rishabhranawat
356756a92e Merge branch 'main' of https://github.com/rishabhranawat/reasoning-gym into aiw 2025-02-01 11:40:18 -08:00
rishabhranawat
60a5df0b2f [aiw] basic version of alice-in-wonderland procedural dataset 2025-02-01 11:37:50 -08:00
Andreas Koepf
770a79848d lint 2025-02-01 17:01:11 +01:00
Andreas Köpf
9a32b80fb9 Merge pull request #38 from Schmeitzke/main
Add Simple and Advanced Geometry Dataset Generators
2025-02-01 17:00:24 +01:00
Andreas Koepf
44f32e3862 Add time interval dataset class 2025-02-01 02:10:48 +01:00
Camiel Schmeitz
fcc8bba1df Merge branch 'open-thought:main' into main 2025-01-31 14:37:28 +01:00
Andreas Koepf
d4706c7128 lint 2025-01-31 12:16:08 +01:00
Andreas Koepf (aider)
bdde47eae1 fix: Correct base conversion test logic for non-standard bases 2025-01-31 12:10:09 +01:00
Andreas Koepf (aider)
aa39c6441a fix: Improve base conversion logic for non-standard bases 2025-01-31 12:09:32 +01:00
Andreas Köpf
e40581a955 Merge pull request #33 from joenorton/tower_of_hanoi
adds Tower of Hanoi
2025-01-31 11:22:06 +01:00
Camiel Schmeitz
ae02c3ab82 Merge branch 'open-thought:main' into main 2025-01-31 11:06:33 +01:00
Joe Norton
31fd2c20d3 linter 2025-01-31 00:05:33 -08:00
joesharratt1229
ca3a841bf6 added linting checks 2025-01-31 07:19:55 +00:00
Joe Norton
b3cd02f853 adds Tower of Hanoi
creates game file & test file, modifies games init to add toh
2025-01-30 23:16:06 -08:00
joesharratt1229
d7ebe409d8 added testing of score answer method 2025-01-31 06:46:18 +00:00
Andreas Koepf
bf62f631dd lint 2025-01-30 23:14:32 +01:00
Andreas Köpf
6117162bad Merge pull request #31 from cavit99/main
feat: Add Word Ladder dataset generator
2025-01-30 23:11:58 +01:00
Andreas Koepf
25540b6634 lint 2025-01-30 22:55:04 +01:00
Andreas Köpf
d9d1f1b2c9 Merge pull request #30 from Miserlou/miserlou/gol
Add Conway's Game of Life Simulations
2025-01-30 22:47:29 +01:00
Andreas Köpf
fb8e0f21af Merge branch 'main' into miserlou/bfi 2025-01-30 22:45:01 +01:00