Commit graph

17 commits

Author SHA1 Message Date
Zafir Stojanovski
290bfc4fdd
(evals): Medium configs (#415)
* updated medium configs

* fix problematic curriculum values / small issues causing exceptions to be raised

* optimus alpha config

* all configs so far

* fix tests
2025-04-14 08:25:31 +02:00
Zafir Stojanovski
dced3bfc45
fix(curriculum): Make boundaries in curriculum more sensible (#407)
* init

* fix tests

* unify codeio

* filtered for libraries not present in reasoning-gym

* fix more bounds

* puzzle24

* knight swap curriculum

* fix number sorting

* fix attributes

* add validation of config in creation of dataset

* dry run for instantiating and validating the datasets

* remove unused imports

* fix curriculum tests to reference newly updated attribute names
2025-04-04 20:24:14 +02:00
Zafir Stojanovski
ce0a6c4878
fix(envs): Add source dataset and index to metadata (#388)
* add source dataset and index to metadata

* fix typo

* fix coach class and its test
2025-03-20 11:12:14 +00:00
joesharratt1229
e30be066ec
Fixed countdown score_answer (#265)
* fixed countdown score ans
* checked solution uses all numbers
2025-03-05 22:30:12 +01:00
Zafir Stojanovski
01e1c8f9af
fix: Unify Prompts (#254)
* remove cot
* fix prompt template
* fix pool matrix
* spiral matrix fixed
2025-03-03 21:55:53 +01:00
Andreas Koepf
3e7ff3b084 use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
Andreas Koepf
2cbaab2918 fix question templates 2025-02-16 23:04:24 +01:00
joesharratt1229
262dcd08f8 updated countdown question template 2025-02-16 17:53:54 +00:00
Andreas Koepf
127f505798 add ArcAgiDataset class, fix score_entry() metadata params 2025-02-08 23:18:18 +01:00
joesharratt1229
37375f08a9 added linting checks 2025-01-31 07:19:55 +00:00
EC2 Default User
131e0d8f19 added countdown score answer impl 2025-01-31 06:42:25 +00:00
Andreas Koepf
31a5b5cb76 formatting 2025-01-25 18:51:28 +01:00
Andreas Koepf (aider)
3fc0c672a0 refactor: Extract candidate expression generation into separate method 2025-01-25 01:54:32 +01:00
Andreas Koepf (aider)
63d42e1af8 refactor: Add retry loop with max attempts for countdown expression generation 2025-01-25 01:52:43 +01:00
Andreas Koepf (aider)
e0a2f18a19 fix: Improve division handling in countdown game generation 2025-01-25 01:39:55 +01:00
Andreas Koepf (aider)
9a9c2b0460 fix: Improve Countdown game expression generation and validation 2025-01-25 01:36:49 +01:00
Andreas Koepf
1116f981c4 rename countdown_game to just countdown 2025-01-25 01:33:22 +01:00
Renamed from reasoning_gym/games/countdown_game.py (Browse further)