Commit graph

13 commits

Author SHA1 Message Date
Zafir Stojanovski
290bfc4fdd
(evals): Medium configs (#415)
* updated medium configs

* fix problematic curriculum values / small issues causing exceptions to be raised

* optimus alpha config

* all configs so far

* fix tests
2025-04-14 08:25:31 +02:00
Zafir Stojanovski
dced3bfc45
fix(curriculum): Make boundaries in curriculum more sensible (#407)
* init

* fix tests

* unify codeio

* filtered for libraries not present in reasoning-gym

* fix more bounds

* puzzle24

* knight swap curriculum

* fix number sorting

* fix attributes

* add validation of config in creation of dataset

* dry run for instantiating and validating the datasets

* remove unused imports

* fix curriculum tests to reference newly updated attribute names
2025-04-04 20:24:14 +02:00
Zafir Stojanovski
ce0a6c4878
fix(envs): Add source dataset and index to metadata (#388)
* add source dataset and index to metadata

* fix typo

* fix coach class and its test
2025-03-20 11:12:14 +00:00
Andreas Köpf
d2c895f1d3
Refactor Curriculum Attributes (#335)
* remove min_value from AttributeDefinition
* remove type from AttributeDefinition
* Add CurriculumContext
* add ensure_interval option for RangeAttributes
* docs: Add legend explaining curriculum indicators in dataset gallery
* update GALLERY.md
2025-03-16 15:40:28 +01:00
Adefioye
8a0cacc054
Add jugs curriculum (#369) 2025-03-14 18:04:33 +01:00
Andreas Köpf
5d7fbac0ad
Minor question template & score_answer improvements (#261)
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
Andreas Koepf
eeb9fa31d5 more native type hints 2025-02-21 21:23:14 +01:00
Andreas Koepf
c236bcf1b6 store possible answer in entry 'answer' field 2025-02-20 22:47:21 +01:00
Andreas Koepf (aider)
9068e1d1b1 docs: Add descriptive comments for num_jugs and difficulty parameters 2025-02-20 22:39:22 +01:00
Andreas Koepf (aider)
ece040a0d0 feat: Add type hints to generate_puzzle and min_moves_n functions 2025-02-20 22:38:11 +01:00
Andreas Koepf
d7db2ed838 refactor: Simplify jug puzzle dataset generation and solution verification 2025-02-20 22:38:10 +01:00
Rich Jones
0f798457ed jugs jugs jugs lint 2025-02-20 16:15:29 +01:00
Rich Jones
6f00690ae1 basic jugs 2025-02-20 15:24:46 +01:00