Commit graph

6 commits

Author SHA1 Message Date
Zafir Stojanovski
dced3bfc45
fix(curriculum): Make boundaries in curriculum more sensible (#407)
* init

* fix tests

* unify codeio

* filtered for libraries not present in reasoning-gym

* fix more bounds

* puzzle24

* knight swap curriculum

* fix number sorting

* fix attributes

* add validation of config in creation of dataset

* dry run for instantiating and validating the datasets

* remove unused imports

* fix curriculum tests to reference newly updated attribute names
2025-04-04 20:24:14 +02:00
Zafir Stojanovski
8d4e9030c0
manipulate matrix curriculum (#293) 2025-03-08 01:57:37 +01:00
Andreas Köpf
5d7fbac0ad
Minor question template & score_answer improvements (#261)
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
Zafir Stojanovski
f549909c3d
fix manipulate matrix (#247) 2025-03-01 23:00:29 +01:00
Zafir Stojanovski
a0a5de3658 add more config params 2025-02-10 22:30:36 +01:00
Zafir Stojanovski
3d66cc6a7f matrix manipulation 2025-02-10 13:51:39 +01:00