reasoning-gym/reasoning_gym
Rich Jones 11c9790a25 [Env] Game of Life Halting Prediction (#272)
This is a variant of the Game of Life task, which rather than trying to test the algorithmic simulation, tests the ability of the model to do explanatory reasoning of the board. The idea is that a model with good explanatory reasoning will be able to see that a game will not halt without simulating it into the future.

The task presents a GoL board, and the model is asked to predict if the board will halt (die, all cells zero) after n steps. Sometimes, the board will be made up of 'oscillators', isolated structures which never die. Othertimes, it is filled with non-oscillators, structures which will always die after a few steps. The model should deduce which case the presented board is.
2025-03-07 10:05:12 +01:00
..
algebra Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
algorithmic [Env] Game of Life Halting Prediction (#272) 2025-03-07 10:05:12 +01:00
arc Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
arithmetic updated algorithmics dataset (#269) 2025-03-05 23:32:53 +01:00
coaching feat(env): NQueens Curriculum (#262) 2025-03-05 15:05:17 +01:00
code First version of CodeI/O reasoning data (#264) 2025-03-05 22:34:11 +01:00
cognition Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
data First version of CodeI/O reasoning data (#264) 2025-03-05 22:34:11 +01:00
games Added puzzle24 closes #208 (#268) 2025-03-05 22:36:37 +01:00
geometry fix: Unify Prompts (#254) 2025-03-03 21:55:53 +01:00
graphs shortest path curriculum (#271) 2025-03-05 22:46:10 +01:00
induction more native type hints 2025-02-21 21:23:14 +01:00
logic Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
__init__.py update gallery, pypi release, bump version 2025-03-05 23:45:45 +01:00
composite.py use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
dataset.py Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
factory.py use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
utils.py Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
version_manager.py use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00