Andreas Koepf (aider)
|
a9549057e9
|
test: Add scoring tests for Arc1D dataset answer evaluation
|
2025-02-02 23:31:20 +01:00 |
|
Andreas Koepf
|
b7532f66ca
|
test: Remove test_arc_1d.py file from tests directory
|
2025-02-02 23:30:15 +01:00 |
|
Andreas Koepf (aider)
|
978a0879f7
|
feat: Add mirrored and inverse task variations to ARC_1D_TASKS
|
2025-02-02 23:21:46 +01:00 |
|
Andreas Koepf
|
9a1270dd95
|
add arc_1d dataset
|
2025-02-02 23:03:56 +01:00 |
|
Andreas Koepf (aider)
|
a060348a9c
|
fix: Resolve undefined task function references in arc_1d.py
|
2025-02-02 22:49:28 +01:00 |
|
Andreas Koepf (aider)
|
b599d6e1a2
|
feat: Add Arc1D dataset with comprehensive task generation and configuration
|
2025-02-02 22:49:00 +01:00 |
|
Andreas Koepf (aider)
|
905ef7b89d
|
feat: Add missing task transformation imports to test_arc_1d.py
|
2025-02-02 22:42:43 +01:00 |
|
Andreas Koepf (aider)
|
84e4f1c5bc
|
feat: Add task augmentation functions mirror, inverse, and identity to arc_1d.py
|
2025-02-02 22:42:21 +01:00 |
|
Andreas Koepf
|
01cc239746
|
add quantum lock answer format hint
|
2025-02-02 22:35:43 +01:00 |
|
Andreas Koepf
|
82196bd2df
|
bump version to 0.1.3, uploaded to pypi
|
2025-02-02 22:26:24 +01:00 |
|
Andreas Koepf
|
057b9f2034
|
auto-load simple/intermediate integration tasks, stable order for n_queens (set was not stable)
|
2025-02-02 22:18:54 +01:00 |
|
Andreas Koepf (aider)
|
751773828f
|
test: Add unit test for score_answer method in N-Queens dataset
|
2025-02-02 22:15:49 +01:00 |
|
Andreas Koepf
|
b026774708
|
refactor: Update test cases to use 'solutions' instead of 'solution' in metadata
|
2025-02-02 22:15:47 +01:00 |
|
Andreas Köpf
|
abd41814e2
|
Merge pull request #46 from joesharratt1229/feat/integration_dataset
Simple and intermediate integration problems dataset generators
|
2025-02-02 21:58:17 +01:00 |
|
Andreas Koepf
|
ccff85f81c
|
run scripts/generate_gallery.py
|
2025-02-02 21:56:17 +01:00 |
|
Andreas Köpf
|
3dd5a4df2e
|
Merge pull request #47 from zafstojano/feat/n-queens
feat(env): N Queens
|
2025-02-02 21:54:02 +01:00 |
|
Cavit Erginsoy
|
372e778c26
|
improved word quality, removed extremly rares
|
2025-02-02 19:24:53 +00:00 |
|
joesharratt1229
|
b0d21cf664
|
added score_answer implementation and tests
|
2025-02-02 17:18:56 +00:00 |
|
Andreas Köpf
|
c4c0897fe0
|
Merge pull request #48 from rishabhranawat/aiw
README updates with some recently added datasets.
|
2025-02-02 17:28:41 +01:00 |
|
Andreas Koepf
|
1df952001e
|
update gallery SyllogismDataset
|
2025-02-02 17:28:01 +01:00 |
|
rishabhranawat
|
b69cb27f75
|
Merge branch 'aiw' of https://github.com/rishabhranawat/reasoning-gym into aiw
|
2025-02-02 08:26:30 -08:00 |
|
rishabhranawat
|
519999ff89
|
Update dataset list w/ some missing logic datasets
|
2025-02-02 08:26:05 -08:00 |
|
Andreas Koepf
|
5dd4c0e831
|
change parameter order for basic arc tasks
|
2025-02-02 17:25:37 +01:00 |
|
Andreas Koepf (aider)
|
56ded2c299
|
feat: Improve syllogism sentence formatting for natural language
|
2025-02-02 17:23:02 +01:00 |
|
Zafir Stojanovski
|
1912c571f9
|
cap N at 12
|
2025-02-02 16:52:36 +01:00 |
|
Zafir Stojanovski
|
c74b600085
|
n queens
|
2025-02-02 16:47:21 +01:00 |
|
Andreas Koepf (aider)
|
28c30c69d1
|
fix: Correct argument passing in ARC 1D task test lambda functions
|
2025-02-02 16:43:25 +01:00 |
|
Andreas Koepf (aider)
|
2d3012d5ae
|
fix: Update test_arc_1d.py to handle task function argument order
|
2025-02-02 16:42:46 +01:00 |
|
Andreas Koepf (aider)
|
d56e8c3a03
|
fix: Remove redundant parameters in ARC 1D task test suite
|
2025-02-02 16:42:21 +01:00 |
|
Andreas Koepf (aider)
|
f0ab9ec0d4
|
test: Add comprehensive unittest for arc_1d task functions
|
2025-02-02 16:40:39 +01:00 |
|
Andreas Koepf (aider)
|
da16467ca7
|
feat: Add five new 1D ARC task generation functions
|
2025-02-02 16:38:14 +01:00 |
|
Andreas Koepf (aider)
|
3714e6c5ff
|
feat: Add five new 1D ARC task generation functions
|
2025-02-02 16:37:14 +01:00 |
|
Andreas Koepf (aider)
|
dc11f88c0b
|
feat: Add new 1D ARC task generation functions for block manipulation
|
2025-02-02 16:36:19 +01:00 |
|
Andreas Koepf (aider)
|
9dac01fda7
|
feat: Add new 1D ARC task generation functions
|
2025-02-02 16:34:52 +01:00 |
|
Andreas Koepf (aider)
|
4c22fca7ed
|
feat: Add new 1D task generation functions to arc_1d.py
|
2025-02-02 16:33:02 +01:00 |
|
Andreas Koepf
|
166e3d5f0d
|
feat: Add arc_1d.py module for one-dimensional abstract reasoning challenges
|
2025-02-02 16:33:01 +01:00 |
|
joesharratt1229
|
f5838da534
|
Merge remote-tracking branch 'origin/main' into feat/integration_dataset
|
2025-02-02 15:31:07 +00:00 |
|
joesharratt1229
|
40e53b8bca
|
added implementation of simple integration dataset
|
2025-02-02 15:30:22 +00:00 |
|
joesharratt1229
|
76faad9dcf
|
created test script for intermediate integration dataset generator
|
2025-02-02 15:30:01 +00:00 |
|
joesharratt1229
|
420a44bd79
|
added impl of simple integration dataset generator
|
2025-02-02 15:29:24 +00:00 |
|
joesharratt1229
|
0eb0247ebd
|
added register dataset to script
|
2025-02-02 15:28:52 +00:00 |
|
joesharratt1229
|
8528e39764
|
added intermediate integration dataset generator
|
2025-02-02 15:27:08 +00:00 |
|
Andreas Koepf
|
8b0f634f4c
|
post merge formatting
|
2025-02-02 15:24:39 +01:00 |
|
Andreas Köpf
|
aa172a193b
|
Merge pull request #43 from open-thought/calendar-arithmetic
added calendar-arithmetic tasks
|
2025-02-02 15:22:50 +01:00 |
|
benjamrio
|
943651c15b
|
added calendar-arithmetic tasks
|
2025-02-02 14:54:32 +01:00 |
|
Andreas Koepf
|
f396d3df60
|
post merge lint
|
2025-02-02 10:04:18 +01:00 |
|
Andreas Köpf
|
02cfa9556a
|
Merge pull request #41 from rishabhranawat/aiw
Add Alice In Wonderland Problem Procedural Dataset
|
2025-02-02 10:00:04 +01:00 |
|
Andreas Koepf (aider)
|
4e9fc4baad
|
refactor: Use field default_factory TimeIntervalsConfig, AdvancedGeometryConfig
|
2025-02-02 09:55:51 +01:00 |
|
abdulhakeem
|
5d0ad82034
|
Add EOL to test_generator_files
|
2025-02-01 20:41:31 -06:00 |
|
abdulhakeem
|
715102c277
|
Remove .DS_Store
|
2025-02-01 20:39:37 -06:00 |
|