Commit graph

4 commits

Author SHA1 Message Date
Zafir Stojanovski
98e976642d
gemini flash and o3 mini configs (#425) 2025-04-20 21:24:52 +02:00
Adefioye
169d8c3aec
Evals: Sonnet-3.7-eval partial results (#423)
* Add results for sonnet and config

* Make some cleanup
2025-04-18 10:31:40 +02:00
Zafir Stojanovski
290bfc4fdd
(evals): Medium configs (#415)
* updated medium configs

* fix problematic curriculum values / small issues causing exceptions to be raised

* optimus alpha config

* all configs so far

* fix tests
2025-04-14 08:25:31 +02:00
Zafir Stojanovski
dced3bfc45
fix(curriculum): Make boundaries in curriculum more sensible (#407)
* init

* fix tests

* unify codeio

* filtered for libraries not present in reasoning-gym

* fix more bounds

* puzzle24

* knight swap curriculum

* fix number sorting

* fix attributes

* add validation of config in creation of dataset

* dry run for instantiating and validating the datasets

* remove unused imports

* fix curriculum tests to reference newly updated attribute names
2025-04-04 20:24:14 +02:00