Commit graph

18 commits

Author SHA1 Message Date
Zafir Stojanovski
dced3bfc45
fix(curriculum): Make boundaries in curriculum more sensible (#407)
* init

* fix tests

* unify codeio

* filtered for libraries not present in reasoning-gym

* fix more bounds

* puzzle24

* knight swap curriculum

* fix number sorting

* fix attributes

* add validation of config in creation of dataset

* dry run for instantiating and validating the datasets

* remove unused imports

* fix curriculum tests to reference newly updated attribute names
2025-04-04 20:24:14 +02:00
Zafir Stojanovski
50846c3534
fix(env): ARC 1D curriculum (#402)
* Add arc_1d curriculum

* Add difficulty to metadata

* use range attribute instead of scalar

---------

Co-authored-by: abdulhakeem <abdulhakeemadefioye@gmail.com>
Co-authored-by: Oliver Stanley <olivergestanley@gmail.com>
2025-04-01 13:01:15 +02:00
Andreas Köpf
5d7fbac0ad
Minor question template & score_answer improvements (#261)
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
Andreas Koepf
469934d9b7 minor arc_1d tweaks 2025-02-23 16:37:40 +01:00
Andreas Koepf
ba56aa0092 add arc_1d size range test 2025-02-23 12:58:51 +01:00
Andreas Koepf
7a45b14a49 fix index out of range of arc_1d dataset (#190) 2025-02-23 12:51:41 +01:00
Andreas Koepf
044da9687f adapt unit tests to partial match changes 2025-02-14 21:30:50 +01:00
Andreas Koepf
60effc6e7a move arc_1d into from cognition into arc folder 2025-02-08 19:37:26 +01:00
Andreas Koepf
3aeec71523 add attribution for arc-1d and unit tests 2025-02-02 23:45:25 +01:00
Andreas Koepf (aider)
a9549057e9 test: Add scoring tests for Arc1D dataset answer evaluation 2025-02-02 23:31:20 +01:00
Andreas Koepf
b7532f66ca test: Remove test_arc_1d.py file from tests directory 2025-02-02 23:30:15 +01:00
Andreas Koepf (aider)
905ef7b89d feat: Add missing task transformation imports to test_arc_1d.py 2025-02-02 22:42:43 +01:00
Andreas Koepf (aider)
84e4f1c5bc feat: Add task augmentation functions mirror, inverse, and identity to arc_1d.py 2025-02-02 22:42:21 +01:00
Andreas Koepf
5dd4c0e831 change parameter order for basic arc tasks 2025-02-02 17:25:37 +01:00
Andreas Koepf (aider)
28c30c69d1 fix: Correct argument passing in ARC 1D task test lambda functions 2025-02-02 16:43:25 +01:00
Andreas Koepf (aider)
2d3012d5ae fix: Update test_arc_1d.py to handle task function argument order 2025-02-02 16:42:46 +01:00
Andreas Koepf (aider)
d56e8c3a03 fix: Remove redundant parameters in ARC 1D task test suite 2025-02-02 16:42:21 +01:00
Andreas Koepf (aider)
f0ab9ec0d4 test: Add comprehensive unittest for arc_1d task functions 2025-02-02 16:40:39 +01:00