Commit graph

11 commits

Author SHA1 Message Date
SII-Whereby
7d68a6cc70
Fix(reasoning_gym/games/countdown): Resolve SymPy parsing conflict for 10+ input numbers (#514)
* Refactor expression generation and substitution logic

Updated symbol naming and added safe replacement for expressions.

* Add expr_str to return values in countdown.py

Modified return statement to include the modified expression string.

* Implement test for min_numbers exceeding 10

Add test for CountdownDataset with more than 10 numbers

* Remove trailing-whitespace

* Improve readability of CountdownDataset initialization

Refactor CountdownDataset initialization for readability.
2025-12-15 11:05:38 +00:00
joesharratt1229
876e0aa440
corrected countdown issue (#479) 2025-06-25 13:37:04 -07:00
Zafir Stojanovski
290bfc4fdd
(evals): Medium configs (#415)
* updated medium configs

* fix problematic curriculum values / small issues causing exceptions to be raised

* optimus alpha config

* all configs so far

* fix tests
2025-04-14 08:25:31 +02:00
Zafir Stojanovski
dced3bfc45
fix(curriculum): Make boundaries in curriculum more sensible (#407)
* init

* fix tests

* unify codeio

* filtered for libraries not present in reasoning-gym

* fix more bounds

* puzzle24

* knight swap curriculum

* fix number sorting

* fix attributes

* add validation of config in creation of dataset

* dry run for instantiating and validating the datasets

* remove unused imports

* fix curriculum tests to reference newly updated attribute names
2025-04-04 20:24:14 +02:00
joesharratt1229
e30be066ec
Fixed countdown score_answer (#265)
* fixed countdown score ans
* checked solution uses all numbers
2025-03-05 22:30:12 +01:00
Andreas Koepf
127f505798 add ArcAgiDataset class, fix score_entry() metadata params 2025-02-08 23:18:18 +01:00
joesharratt1229
37375f08a9 added linting checks 2025-01-31 07:19:55 +00:00
joesharratt1229
4fea3c3378 added testing of score answer method 2025-01-31 06:46:18 +00:00
Andreas Koepf
31a5b5cb76 formatting 2025-01-25 18:51:28 +01:00
Andreas Koepf (aider)
9a9c2b0460 fix: Improve Countdown game expression generation and validation 2025-01-25 01:36:49 +01:00
Andreas Koepf
1116f981c4 rename countdown_game to just countdown 2025-01-25 01:33:22 +01:00
Renamed from tests/test_countdown_game.py (Browse further)