reasoning-gym

1179 commits 21 branches 5 tags 37 MiB

Author	SHA1	Message	Date
Rich Jones	07dc01ad87	[Env] Game of Life Halting Prediction (#272 ) This is a variant of the Game of Life task, which rather than trying to test the algorithmic simulation, tests the ability of the model to do explanatory reasoning of the board. The idea is that a model with good explanatory reasoning will be able to see that a game will not halt without simulating it into the future. The task presents a GoL board, and the model is asked to predict if the board will halt (die, all cells zero) after n steps. Sometimes, the board will be made up of 'oscillators', isolated structures which never die. Othertimes, it is filled with non-oscillators, structures which will always die after a few steps. The model should deduce which case the presented board is.	2025-03-07 10:05:12 +01:00
Andreas Köpf	5d7fbac0ad	Minor question template & score_answer improvements (#261 ) * math prompt improvements * ignore brackets in complex_arithmetic results * improve additional instruction in prompt of polynomial_equations * more strict tests for score_answer in polynomial_equations * simplify special reward handling * fix test_intermediate_integration * fix sokoban dataset * add common dataset score_answer consistency test	2025-03-04 21:55:09 +01:00
Rich Jones	0ba6119850	Game of Life partial scoring and rule-clarification (#258 ) * partial scoring and rule clarification * better ql scoring * word seq reverse typos	2025-03-03 22:22:39 +01:00
Andreas Koepf	3e7ff3b084	use native types List->list, Dict->dict, Set->set, Tuple->tuple	2025-02-21 15:15:38 +01:00
Rich Jones	445b63bbf2	fix weird GoL fmt	2025-02-20 11:09:29 +01:00
abdulhakeem	15140d06e7	Tweaked some question templates	2025-02-17 02:58:42 -06:00
Andreas Koepf	5d16a2193c	import string_insertion, minor whitespace fixes, gallery update	2025-02-14 07:55:38 +01:00
Andreas Koepf	8960d66beb	move game_of_life from games to algorithmic	2025-02-14 07:26:47 +01:00

Renamed from reasoning_gym/games/game_of_life.py (Browse further)

8 commits