Zafir Stojanovski
50846c3534
fix(env): ARC 1D curriculum ( #402 )
...
* Add arc_1d curriculum
* Add difficulty to metadata
* use range attribute instead of scalar
---------
Co-authored-by: abdulhakeem <abdulhakeemadefioye@gmail.com>
Co-authored-by: Oliver Stanley <olivergestanley@gmail.com>
2025-04-01 13:01:15 +02:00
Andreas Köpf
5d7fbac0ad
Minor question template & score_answer improvements ( #261 )
...
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
Andreas Koepf
469934d9b7
minor arc_1d tweaks
2025-02-23 16:37:40 +01:00
Andreas Koepf
ba56aa0092
add arc_1d size range test
2025-02-23 12:58:51 +01:00
Andreas Koepf
7a45b14a49
fix index out of range of arc_1d dataset ( #190 )
2025-02-23 12:51:41 +01:00
Andreas Koepf
044da9687f
adapt unit tests to partial match changes
2025-02-14 21:30:50 +01:00
Andreas Koepf
60effc6e7a
move arc_1d into from cognition into arc folder
2025-02-08 19:37:26 +01:00
Andreas Koepf
3aeec71523
add attribution for arc-1d and unit tests
2025-02-02 23:45:25 +01:00
Andreas Koepf (aider)
a9549057e9
test: Add scoring tests for Arc1D dataset answer evaluation
2025-02-02 23:31:20 +01:00
Andreas Koepf
b7532f66ca
test: Remove test_arc_1d.py file from tests directory
2025-02-02 23:30:15 +01:00
Andreas Koepf (aider)
905ef7b89d
feat: Add missing task transformation imports to test_arc_1d.py
2025-02-02 22:42:43 +01:00
Andreas Koepf (aider)
84e4f1c5bc
feat: Add task augmentation functions mirror, inverse, and identity to arc_1d.py
2025-02-02 22:42:21 +01:00
Andreas Koepf
5dd4c0e831
change parameter order for basic arc tasks
2025-02-02 17:25:37 +01:00
Andreas Koepf (aider)
28c30c69d1
fix: Correct argument passing in ARC 1D task test lambda functions
2025-02-02 16:43:25 +01:00
Andreas Koepf (aider)
2d3012d5ae
fix: Update test_arc_1d.py to handle task function argument order
2025-02-02 16:42:46 +01:00
Andreas Koepf (aider)
d56e8c3a03
fix: Remove redundant parameters in ARC 1D task test suite
2025-02-02 16:42:21 +01:00
Andreas Koepf (aider)
f0ab9ec0d4
test: Add comprehensive unittest for arc_1d task functions
2025-02-02 16:40:39 +01:00