Commit graph

69 commits

Author SHA1 Message Date
Zafir Stojanovski
dc657b5ed4 feat(env): Binary Matrix Curriculum (#279)
* binary matrix curriculum

* register BinaryMatrixCurriculum

---------

Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-07 22:58:47 +01:00
Zafir Stojanovski
dfc28c94d6 feat(env): Binary Alternation Curriculum (#278)
* binary alternation

---------

Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-07 22:44:32 +01:00
Rich Jones
11c9790a25 [Env] Game of Life Halting Prediction (#272)
This is a variant of the Game of Life task, which rather than trying to test the algorithmic simulation, tests the ability of the model to do explanatory reasoning of the board. The idea is that a model with good explanatory reasoning will be able to see that a game will not halt without simulating it into the future.

The task presents a GoL board, and the model is asked to predict if the board will halt (die, all cells zero) after n steps. Sometimes, the board will be made up of 'oscillators', isolated structures which never die. Othertimes, it is filled with non-oscillators, structures which will always die after a few steps. The model should deduce which case the presented board is.
2025-03-07 10:05:12 +01:00
Zafir Stojanovski
a168605fc7 pre-commit 2025-02-21 13:39:05 +01:00
Zafir Stojanovski
6c46b93ae2 binary alternation 2025-02-21 13:09:21 +01:00
Andreas Köpf
07587d1647 Merge branch 'main' into env/rotten-oranges 2025-02-20 22:51:07 +01:00
Zafir Stojanovski
51ea7778ee rotten oranges 2025-02-20 22:33:39 +01:00
Rich Jones
3eef5841f6 basic jugs 2025-02-20 15:24:46 +01:00
Zafir Stojanovski
ed606631bb Merge branch 'main' of https://github.com/open-thought/reasoning-gym into env/palindrome-partitioning 2025-02-18 14:08:00 +01:00
Andreas Koepf
446913fee6 import CryptarithmDataset in algorithmic/__init__.py 2025-02-16 16:32:17 +01:00
Zafir Stojanovski
f9e846871b Merge branch 'main' of https://github.com/open-thought/reasoning-gym into env/string-splitting 2025-02-14 17:51:18 +01:00
Zafir Stojanovski
60ebb1329d string splitting 2025-02-14 17:50:07 +01:00
Andreas Koepf
ee2c03ea65 import string_insertion, minor whitespace fixes, gallery update 2025-02-14 07:55:38 +01:00
Andreas Koepf
80719c7201 move game_of_life from games to algorithmic 2025-02-14 07:26:47 +01:00
Andreas Köpf
3ceb7310ca Merge pull request #126 from zafstojano/env/string-synthesis
String Synthesis
2025-02-14 07:11:25 +01:00
Andreas Köpf
25e7bf140a Merge branch 'main' into rich/graphcolor 2025-02-14 07:09:38 +01:00
abdulhakeem
3b1ab9eb86 Fix more conflict 2025-02-13 21:24:05 -06:00
Zafir Stojanovski
c64a32155a lint 2025-02-13 21:15:17 -06:00
Zafir Stojanovski
aaf1df285e string insertion 2025-02-13 21:15:17 -06:00
Zafir Stojanovski
3202908c50 string synthesis 2025-02-13 16:33:28 +01:00
Rich Jones
9004e89bc3 Merge branch 'main' into rich/graphcolor 2025-02-13 13:43:34 +01:00
Andreas Köpf
1a7d248607 Merge branch 'main' into env/string-insertion 2025-02-13 13:07:29 +01:00
Rich Jones
14052133bb add graph coloring 2025-02-13 01:28:09 +01:00
Zafir Stojanovski
b45d777e2c string manipulation 2025-02-12 22:28:23 +01:00
Zafir Stojanovski
4f11003ab7 lint 2025-02-12 17:26:23 +01:00
Zafir Stojanovski
4c5bc58a29 string insertion 2025-02-12 15:18:51 +01:00
Andreas Köpf
1669bba91b Merge branch 'main' into env/pool_matrix 2025-02-12 14:07:25 +01:00
Andreas Köpf
18df4d33e8 Merge branch 'main' into rich/ab 2025-02-11 23:34:48 +01:00
Zafir Stojanovski
b39184d09e pool matrix 2025-02-11 22:22:39 +01:00
Rich Jones
cb4baab029 Add A::B Challenges 2025-02-11 18:08:25 +01:00
Zafir Stojanovski
d647498c43 lint 2025-02-11 14:44:46 +01:00
Zafir Stojanovski
3873c50ac6 count primes 2025-02-11 14:44:38 +01:00
Zafir Stojanovski
696fdf8be7 Merge branch 'main' of https://github.com/open-thought/reasoning-gym into env/matrix-manipulation 2025-02-10 20:40:41 +01:00
Zafir Stojanovski
111f4c9170 matrix manipulation 2025-02-10 13:51:39 +01:00
Zafir Stojanovski
f7836e17d0 binary matrix 2025-02-09 01:10:57 +01:00
Zafir Stojanovski
6815a892e2 Merge branch 'main' of https://github.com/open-thought/reasoning-gym into env/spiral-matrix 2025-02-08 18:52:45 +01:00
Andreas Köpf
b0a133904f Merge branch 'main' into env/rotate-matrix 2025-02-08 17:42:04 +01:00
Zafir Stojanovski
bc96024b45 rotate matrix 2025-02-08 14:27:10 +01:00
Zafir Stojanovski
9e6af4a871 Merge branch 'main' of https://github.com/open-thought/reasoning-gym into env/ransom-note 2025-02-08 13:19:37 +01:00
Zafir Stojanovski
f10f2417b7 Merge branch 'main' of https://github.com/open-thought/reasoning-gym into env/spiral-matrix 2025-02-08 13:15:02 +01:00
Zafir Stojanovski
0fbed2cf04 isomorphic strings 2025-02-07 18:23:34 +01:00
Zafir Stojanovski
1b62e06a77 pre-commit 2025-02-07 14:48:07 +01:00
Zafir Stojanovski
8e968e87fb ransom note 2025-02-07 14:47:00 +01:00
Zafir Stojanovski
ff938f1f26 spiral matrix 2025-02-07 12:46:36 +01:00
Zafir Stojanovski
138e55522e palindrome partitioning 2025-02-06 21:42:39 +01:00
Zafir Stojanovski
7f611c2e0e group anagrams env 2025-02-06 00:11:07 +01:00
Joe Norton
0cc2645027 add palindrome_generation 2025-01-31 18:45:52 -08:00
Andreas Koepf
bf62f631dd lint 2025-01-30 23:14:32 +01:00
Cavit Erginsoy
d57a7947a4 INIT 2025-01-30 21:32:46 +00:00
Andreas Koepf
ad9f0d265c fix unit tests, lower python dependency to 3.9 2025-01-26 16:55:17 +01:00