Zafir Stojanovski
290bfc4fdd
(evals): Medium configs ( #415 )
...
* updated medium configs
* fix problematic curriculum values / small issues causing exceptions to be raised
* optimus alpha config
* all configs so far
* fix tests
2025-04-14 08:25:31 +02:00
Zafir Stojanovski
dced3bfc45
fix(curriculum): Make boundaries in curriculum more sensible ( #407 )
...
* init
* fix tests
* unify codeio
* filtered for libraries not present in reasoning-gym
* fix more bounds
* puzzle24
* knight swap curriculum
* fix number sorting
* fix attributes
* add validation of config in creation of dataset
* dry run for instantiating and validating the datasets
* remove unused imports
* fix curriculum tests to reference newly updated attribute names
2025-04-04 20:24:14 +02:00
Zafir Stojanovski
ce0a6c4878
fix(envs): Add source dataset and index to metadata ( #388 )
...
* add source dataset and index to metadata
* fix typo
* fix coach class and its test
2025-03-20 11:12:14 +00:00
joesharratt1229
e30be066ec
Fixed countdown score_answer ( #265 )
...
* fixed countdown score ans
* checked solution uses all numbers
2025-03-05 22:30:12 +01:00
Zafir Stojanovski
01e1c8f9af
fix: Unify Prompts ( #254 )
...
* remove cot
* fix prompt template
* fix pool matrix
* spiral matrix fixed
2025-03-03 21:55:53 +01:00
Andreas Koepf
3e7ff3b084
use native types List->list, Dict->dict, Set->set, Tuple->tuple
2025-02-21 15:15:38 +01:00
Andreas Koepf
2cbaab2918
fix question templates
2025-02-16 23:04:24 +01:00
joesharratt1229
262dcd08f8
updated countdown question template
2025-02-16 17:53:54 +00:00
Andreas Koepf
127f505798
add ArcAgiDataset class, fix score_entry() metadata params
2025-02-08 23:18:18 +01:00
joesharratt1229
37375f08a9
added linting checks
2025-01-31 07:19:55 +00:00
EC2 Default User
131e0d8f19
added countdown score answer impl
2025-01-31 06:42:25 +00:00
Andreas Koepf
31a5b5cb76
formatting
2025-01-25 18:51:28 +01:00
Andreas Koepf (aider)
3fc0c672a0
refactor: Extract candidate expression generation into separate method
2025-01-25 01:54:32 +01:00
Andreas Koepf (aider)
63d42e1af8
refactor: Add retry loop with max attempts for countdown expression generation
2025-01-25 01:52:43 +01:00
Andreas Koepf (aider)
e0a2f18a19
fix: Improve division handling in countdown game generation
2025-01-25 01:39:55 +01:00
Andreas Koepf (aider)
9a9c2b0460
fix: Improve Countdown game expression generation and validation
2025-01-25 01:36:49 +01:00
Andreas Koepf
1116f981c4
rename countdown_game to just countdown
2025-01-25 01:33:22 +01:00