* init
* fix tests
* unify codeio
* filtered for libraries not present in reasoning-gym
* fix more bounds
* puzzle24
* knight swap curriculum
* fix number sorting
* fix attributes
* add validation of config in creation of dataset
* dry run for instantiating and validating the datasets
* remove unused imports
* fix curriculum tests to reference newly updated attribute names
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test