Commit graph

172 commits

Author SHA1 Message Date
Andreas Köpf
529f83f522
Merge pull request #67 from idigitopia/add-complex-number-arithmetic
feat: Add Complex Arithmetic Dataset and Tests
2025-02-06 07:59:11 +01:00
Aayam
9280d22b83 Apply pre-commit fixes 2025-02-05 22:53:36 -08:00
Andreas Köpf
aa024ce5b9
Merge pull request #63 from open-thought/gsm_symbolic_tests
Gsm symbolic fixes
2025-02-05 21:15:35 +01:00
Andreas Koepf
afb95508ef gsm_symbolic generator changes 2025-02-05 20:58:01 +01:00
Andreas Köpf
c4c0b3b2d8
Merge pull request #61 from open-thought/composite_dataset
Add composite dataset
2025-02-05 19:05:31 +01:00
Aayam
5be79bcb1b feat: Add Complex Arithmetic Dataset and Tests
This commit introduces a new dataset for complex number arithmetic operations:

- Implements ComplexArithmeticDataset for generating complex number problems
- Supports addition, subtraction, multiplication, and division operations

Part of the algebra tasks collection in reasoning-gym.
2025-02-05 08:53:06 -08:00
Zafir Stojanovski
eee0b36983 course schedule 2025-02-04 23:50:24 +01:00
Andreas Koepf
48999261dd register composite dataset 2025-02-04 19:17:34 +01:00
Andreas Koepf (aider)
c2e77f92aa Based on the implementation and requirements, here's a concise commit message:
feat: Add CompositeDataset for weighted multi-dataset sampling
2025-02-04 19:06:13 +01:00
Andreas Köpf
0cbd376dc1
Merge pull request #57 from zafstojano/env/largest-island
Find Largest Island (BFS)
2025-02-04 00:20:06 +01:00
Andreas Koepf (aider)
d0760926d0 test: Add deterministic test for ZebraDataset generation 2025-02-03 22:59:23 +01:00
Zafir Stojanovski
083c8436db added largest island code 2025-02-03 22:46:06 +01:00
Andreas Köpf
052d76d2ca
Merge pull request #55 from Miserlou/rich/fflogic
Adds Zebra/Murdle/Einstein/Grid Style Puzzles
2025-02-03 17:52:22 +01:00
Rich Jones
4d950e562a cleanup 2025-02-03 16:47:29 +01:00
Andreas Köpf
c6fbff7d8f
Merge pull request #52 from cavit99/main
Improve Word Ladder and add complete example suite
2025-02-03 15:16:29 +01:00
Rich Jones
0c9094e9f4 adds zebrapuzzles 2025-02-03 14:34:57 +01:00
Andreas Köpf
57c605002a
Merge pull request #50 from joenorton/toh-score-answer
feat: toh scoring
2025-02-03 12:39:45 +01:00
Cavit Erginsoy
aff0fecef4 lint 2025-02-03 11:35:30 +00:00
Andreas Köpf
00961bc72a
Merge pull request #39 from joenorton/palindrome_generation
feat: add palindrome_generation
2025-02-03 10:10:29 +01:00
Cavit Erginsoy
9b1068ea39 Merge remote-tracking branch 'upstream/main' 2025-02-03 07:44:32 +00:00
Cavit Erginsoy
4355d5a5fe Comprehensive test suite for word ladder dataset generation
- Added extensive test coverage for word ladder configuration validation
- Implemented tests for neighbor computation, path finding, and graph caching
- Included performance and edge case tests
- Verified solution optimality and path generation logic
- Added tests for word set loading and pair generation
2025-02-03 07:22:03 +00:00
Cavit Erginsoy
c0a8a9e46f update test to match 2025-02-03 03:27:49 +00:00
Joe Norton
731d36f43f add palindrome score_answer
add palindrome score_answer & test
2025-02-02 18:04:47 -08:00
Joe Norton
8222823c28 add toh score_answer 2025-02-02 16:37:20 -08:00
Andreas Koepf
3aeec71523 add attribution for arc-1d and unit tests 2025-02-02 23:45:25 +01:00
Andreas Koepf (aider)
a9549057e9 test: Add scoring tests for Arc1D dataset answer evaluation 2025-02-02 23:31:20 +01:00
Andreas Koepf
b7532f66ca test: Remove test_arc_1d.py file from tests directory 2025-02-02 23:30:15 +01:00
Andreas Koepf
9a1270dd95 add arc_1d dataset 2025-02-02 23:03:56 +01:00
Andreas Koepf (aider)
905ef7b89d feat: Add missing task transformation imports to test_arc_1d.py 2025-02-02 22:42:43 +01:00
Andreas Koepf (aider)
84e4f1c5bc feat: Add task augmentation functions mirror, inverse, and identity to arc_1d.py 2025-02-02 22:42:21 +01:00
Andreas Koepf
057b9f2034 auto-load simple/intermediate integration tasks, stable order for n_queens (set was not stable) 2025-02-02 22:18:54 +01:00
Andreas Koepf (aider)
751773828f test: Add unit test for score_answer method in N-Queens dataset 2025-02-02 22:15:49 +01:00
Andreas Koepf
b026774708 refactor: Update test cases to use 'solutions' instead of 'solution' in metadata 2025-02-02 22:15:47 +01:00
Andreas Köpf
abd41814e2
Merge pull request #46 from joesharratt1229/feat/integration_dataset
Simple and intermediate integration problems dataset generators
2025-02-02 21:58:17 +01:00
Andreas Köpf
3dd5a4df2e
Merge pull request #47 from zafstojano/feat/n-queens
feat(env): N Queens
2025-02-02 21:54:02 +01:00
joesharratt1229
b0d21cf664 added score_answer implementation and tests 2025-02-02 17:18:56 +00:00
Andreas Koepf
5dd4c0e831 change parameter order for basic arc tasks 2025-02-02 17:25:37 +01:00
Zafir Stojanovski
c74b600085 n queens 2025-02-02 16:47:21 +01:00
Andreas Koepf (aider)
28c30c69d1 fix: Correct argument passing in ARC 1D task test lambda functions 2025-02-02 16:43:25 +01:00
Andreas Koepf (aider)
2d3012d5ae fix: Update test_arc_1d.py to handle task function argument order 2025-02-02 16:42:46 +01:00
Andreas Koepf (aider)
d56e8c3a03 fix: Remove redundant parameters in ARC 1D task test suite 2025-02-02 16:42:21 +01:00
Andreas Koepf (aider)
f0ab9ec0d4 test: Add comprehensive unittest for arc_1d task functions 2025-02-02 16:40:39 +01:00
joesharratt1229
f5838da534 Merge remote-tracking branch 'origin/main' into feat/integration_dataset 2025-02-02 15:31:07 +00:00
joesharratt1229
40e53b8bca added implementation of simple integration dataset 2025-02-02 15:30:22 +00:00
joesharratt1229
76faad9dcf created test script for intermediate integration dataset generator 2025-02-02 15:30:01 +00:00
Andreas Koepf
8b0f634f4c post merge formatting 2025-02-02 15:24:39 +01:00
benjamrio
943651c15b added calendar-arithmetic tasks 2025-02-02 14:54:32 +01:00
Andreas Koepf
f396d3df60 post merge lint 2025-02-02 10:04:18 +01:00
rishabhranawat
f8696d6d22 [aiw] remove output format enum 2025-02-01 16:31:45 -08:00
rishabhranawat
3d42e84807 [aiw] remove output_formats style and change return type to a standard format 2025-02-01 16:30:05 -08:00