Commit graph

238 commits

Author SHA1 Message Date
Zafir Stojanovski
a23c8c3d4e
sentence reordering curriculum (#326) 2025-03-11 00:21:41 +01:00
Zafir Stojanovski
9aeef4ebb0
palindrome generation curriculum (#322) 2025-03-11 00:19:11 +01:00
Zafir Stojanovski
ad48c551f9
feat(env): Number Sorting Curriculum (#321)
* number sorting curriculum

* metadata
2025-03-11 00:18:20 +01:00
Zafir Stojanovski
0bce1a6ae1
feat(env): Letter Jumble Curriculum (#319)
* base curriculum

* tests
2025-03-11 00:16:05 +01:00
Rich Jones
2b8f21c502
Correct Graph Coloring Difficulty (#318)
* correct gcolor difficulty

* refactor test
2025-03-11 00:14:38 +01:00
Rich Jones
d9ef4f4d14
Fix GoL-Halt Determinism (#317)
* test alt case

* fix determinism of gol-halt
2025-03-11 00:13:40 +01:00
Andreas Koepf
a49463c323 use file stem name of palindrome_generation dataset 2025-03-10 00:39:29 +01:00
Zafir Stojanovski
a1dc28aa73
feat(env): String Synthesis Curriculum (#308)
* string synthesis curriculum

* difficulty metadata
2025-03-10 00:27:03 +01:00
Zafir Stojanovski
037905667e
string splitting curriculum (#307) 2025-03-10 00:25:56 +01:00
Zafir Stojanovski
83cd34e21b
letter counting curriculum (#312) 2025-03-10 00:24:42 +01:00
Zafir Stojanovski
b88cadf75a
feat(env): Word Sequence Reversal curriculum (#313)
* word sequence reversal curriculum

* metadata
2025-03-10 00:24:05 +01:00
Zafir Stojanovski
54b216a5dc
string manipulation curriculum (#306) 2025-03-09 18:12:35 +01:00
Zafir Stojanovski
925283f342
string insertion curriculum (#305) 2025-03-09 18:11:29 +01:00
vncntt
af6120c095
add metadata for caesar cipher, graph coloring, decimal arithmetic (#304)
* add metadata for caesar cipher, graph coloring, decimal arithmetic

* delete comma

* clean up variables
2025-03-09 18:08:56 +01:00
vncntt
fc908d4cf4
Caesar cipher curriculum (#302)
* caesar cipher curriculum + tests
2025-03-09 08:23:32 +01:00
vncntt
e0f8ef061d
graph color curriculum (#303) 2025-03-09 08:20:47 +01:00
Zafir Stojanovski
2fca962847
ransom note curriculum (#300)
Co-authored-by: Andreas Köpf <andreas.koepf@xamla.com>
2025-03-08 21:00:13 +01:00
Zafir Stojanovski
bfa3a58829
palindrome partitioning curriculum (#299)
Co-authored-by: Andreas Köpf <andreas.koepf@xamla.com>
2025-03-08 20:58:59 +01:00
Zafir Stojanovski
194f08cad2
pool matrix curriculum (#298) 2025-03-08 20:57:22 +01:00
Zafir Stojanovski
5963cbd59e
rotten oranges curriculum (#297) 2025-03-08 20:56:46 +01:00
Zafir Stojanovski
6270e835bb
spiral matrix curriculum (#296) 2025-03-08 20:56:08 +01:00
Andreas Köpf
6615d8e662
Show curricula (#295)
* feat: Add debug_curricula.py script to generate CURRICULA.md with dataset curriculum details
2025-03-08 14:21:50 +01:00
Zafir Stojanovski
edab0389b6
rotate matrix curriculum (#294) 2025-03-08 01:58:54 +01:00
Zafir Stojanovski
8d4e9030c0
manipulate matrix curriculum (#293) 2025-03-08 01:57:37 +01:00
Zafir Stojanovski
e69ed78c26
feat(env): Isomorphic Strings Curriculum (#292)
* isomorphic strings curriculum

---------

Co-authored-by: Andreas Köpf <andreas.koepf@xamla.com>
2025-03-08 01:56:14 +01:00
joesharratt1229
af5a6533c8
added word sort curriculum (#289) 2025-03-08 01:50:13 +01:00
Zafir Stojanovski
2d05a48f9b
feat(env): Group Anagrams Curriculum (#288)
* group anagrams curriculum
2025-03-08 01:49:12 +01:00
Zafir Stojanovski
9fc9cf4597
feat(env): Count Primes Curriculum (#287)
* count primes curriculum
2025-03-08 01:48:00 +01:00
Zafir Stojanovski
adf8cd8f6d
base conversion curriculum (#286) 2025-03-08 01:46:32 +01:00
Zafir Stojanovski
25b8e35589
feat(env): Binary Matrix Curriculum (#279)
* binary matrix curriculum

* register BinaryMatrixCurriculum

---------

Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-07 22:58:47 +01:00
Zafir Stojanovski
a8e920b552
feat(env): Binary Alternation Curriculum (#278)
* binary alternation

---------

Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-07 22:44:32 +01:00
Andreas Koepf
c504efc2c3 use relative import for reasoning_gym.data 2025-03-07 15:56:45 +01:00
Rich Jones
07dc01ad87
[Env] Game of Life Halting Prediction (#272)
This is a variant of the Game of Life task, which rather than trying to test the algorithmic simulation, tests the ability of the model to do explanatory reasoning of the board. The idea is that a model with good explanatory reasoning will be able to see that a game will not halt without simulating it into the future.

The task presents a GoL board, and the model is asked to predict if the board will halt (die, all cells zero) after n steps. Sometimes, the board will be made up of 'oscillators', isolated structures which never die. Othertimes, it is filled with non-oscillators, structures which will always die after a few steps. The model should deduce which case the presented board is.
2025-03-07 10:05:12 +01:00
joesharratt1229
d9638df79c
updated algorithmics dataset (#269)
* updated algorithmic datasets
* added changes to symbolic and power
* updated power function test
2025-03-05 23:32:53 +01:00
Andreas Köpf
5d7fbac0ad
Minor question template & score_answer improvements (#261)
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
Rich Jones
0ba6119850
Game of Life partial scoring and rule-clarification (#258)
* partial scoring and rule clarification
* better ql scoring
* word seq reverse typos
2025-03-03 22:22:39 +01:00
Zafir Stojanovski
01e1c8f9af
fix: Unify Prompts (#254)
* remove cot
* fix prompt template
* fix pool matrix
* spiral matrix fixed
2025-03-03 21:55:53 +01:00
joesharratt1229
49db4ed761
small change to word sequence reversal prompt (#252)
corrected ansewr format
2025-03-02 17:34:35 +01:00
Zafir Stojanovski
f549909c3d
fix manipulate matrix (#247) 2025-03-01 23:00:29 +01:00
Rich Jones
39f151ad14
more dynamic scoring for jumble (#246) 2025-03-01 18:50:59 +01:00
Rich Jones
a6c90f40a1 rm typo 2025-02-27 13:44:33 +01:00
Rich Jones
1b95cd3206 fix graph color example template 2025-02-27 13:43:01 +01:00
AhmedSaif2
dcdc38b15d Fix primes representation in count_primes dataset metadata 2025-02-26 14:58:21 +02:00
Andreas Köpf
48f082663a
Fix PoolMatrixConfigs::score_answer(), add unit tests (#215) 2025-02-26 00:43:18 +01:00
Andreas Koepf
bba128ffd0 fix score_answer of pool_matrix (if -> elif), remove print 2025-02-25 23:43:29 +01:00
Andreas Koepf
f9e8f8b064 add try-except to GraphColorDataset.score_answer() 2025-02-25 23:43:29 +01:00
Andreas Koepf
65d17b9850 add None/empty check to score_answer of cryptarithm 2025-02-25 23:43:29 +01:00
Andreas Koepf
eeb9fa31d5 more native type hints 2025-02-21 21:23:14 +01:00
Andreas Köpf
2947038557
Merge pull request #182 from zafstojano/env/binary-alternation
feat(env): Binary Alternation
2025-02-21 17:27:16 +01:00
Andreas Köpf
a6a5d30f1c
Merge pull request #175 from AhmedSaif2/fix-format
Add score_answer function to handle comma-formatted numbers
2025-02-21 15:36:21 +01:00