Zafir Stojanovski
4c47527130
fix(envs): Add source dataset and index to metadata ( #388 )
...
* add source dataset and index to metadata
* fix typo
* fix coach class and its test
2025-03-20 11:12:14 +00:00
Oliver Stanley
c6d01541aa
include ranges rather than sampled values in difficulty metadata dicts ( #387 )
...
* update difficulty metadata for logic datasets
* update difficulty metadata for graph datasets
* update difficulty metadata for geometry datasets
* update difficulty metadata for games datasets
* update difficulty metadata for cognition datasets
* update difficulty metadata for arithmetic datasets
* update difficulty metadata for arc datasets
* update difficulty metadata for algorithmic datasets
* update difficulty metadata for algebra datasets
* use tuples
* update tests
* update tests
2025-03-20 10:27:03 +01:00
Andreas Koepf
eaef88b45b
don't pass answer value to eval
2025-03-17 23:13:53 +01:00
Jean Kaddour
2e9877ae07
fix: add score_answer() to number_sorting ( #380 )
...
* fix: add score_answer() to number_sorting
* chore: run pre-commit
* fix: use json.loads()
* fix: run isort()
2025-03-17 23:04:13 +01:00
Andreas Köpf
ed766028fb
Refactor Curriculum Attributes ( #335 )
...
* remove min_value from AttributeDefinition
* remove type from AttributeDefinition
* Add CurriculumContext
* add ensure_interval option for RangeAttributes
* docs: Add legend explaining curriculum indicators in dataset gallery
* update GALLERY.md
2025-03-16 15:40:28 +01:00
Adefioye
7bab2ecef8
Add jugs curriculum ( #369 )
2025-03-14 18:04:33 +01:00
Rich Jones
4006200132
GoL-Halt Curricula ( #366 )
...
* GoL-Halt Curricula
* trivial
2025-03-14 16:15:45 +01:00
Oliver Stanley
0fdb8f77dd
add word ladder curriculum ( #361 )
...
* add word ladder curriculum
* add to __init__.py
2025-03-14 16:10:52 +01:00
Adefioye
02e0e3b0c5
Add gol curriculum ( #354 )
...
* Add gol curriculum
* Add difficulty
* Make levels of grid size of x and y be valid
2025-03-13 21:09:09 +01:00
Adefioye
5e0ccf3499
Cryptarithm curriculum ( #346 )
...
* Add curriculum for cryptarithm
* Add difficulty to metadata
2025-03-13 21:03:57 +01:00
Adefioye
454250a4ea
Add curriculum to ab dataset ( #345 )
...
* Add curriculum to ab dataset
* Add difficulty to metadata
2025-03-13 21:03:02 +01:00
Zafir Stojanovski
ede43c58ba
number filtering curriculum ( #333 )
2025-03-11 23:56:06 +01:00
Zafir Stojanovski
b926800267
spell backward curriculum ( #327 )
...
Co-authored-by: Andreas Köpf <andreas.koepf@xamla.com>
2025-03-11 00:22:28 +01:00
Zafir Stojanovski
2cd2ec3dec
sentence reordering curriculum ( #326 )
2025-03-11 00:21:41 +01:00
Zafir Stojanovski
5cd4c57823
palindrome generation curriculum ( #322 )
2025-03-11 00:19:11 +01:00
Zafir Stojanovski
f9fa667d82
feat(env): Number Sorting Curriculum ( #321 )
...
* number sorting curriculum
* metadata
2025-03-11 00:18:20 +01:00
Zafir Stojanovski
80d74d96fb
feat(env): Letter Jumble Curriculum ( #319 )
...
* base curriculum
* tests
2025-03-11 00:16:05 +01:00
Rich Jones
a49ed1342f
Correct Graph Coloring Difficulty ( #318 )
...
* correct gcolor difficulty
* refactor test
2025-03-11 00:14:38 +01:00
Rich Jones
da309310e5
Fix GoL-Halt Determinism ( #317 )
...
* test alt case
* fix determinism of gol-halt
2025-03-11 00:13:40 +01:00
Andreas Koepf
cd634a3755
use file stem name of palindrome_generation dataset
2025-03-10 00:39:29 +01:00
Zafir Stojanovski
9bd4f03dbd
feat(env): String Synthesis Curriculum ( #308 )
...
* string synthesis curriculum
* difficulty metadata
2025-03-10 00:27:03 +01:00
Zafir Stojanovski
01f69e06c2
string splitting curriculum ( #307 )
2025-03-10 00:25:56 +01:00
Zafir Stojanovski
04f0f7deb7
letter counting curriculum ( #312 )
2025-03-10 00:24:42 +01:00
Zafir Stojanovski
6aa7547abd
feat(env): Word Sequence Reversal curriculum ( #313 )
...
* word sequence reversal curriculum
* metadata
2025-03-10 00:24:05 +01:00
Zafir Stojanovski
7c7c783883
string manipulation curriculum ( #306 )
2025-03-09 18:12:35 +01:00
Zafir Stojanovski
e1e05884ee
string insertion curriculum ( #305 )
2025-03-09 18:11:29 +01:00
vncntt
91aa3f3ae2
add metadata for caesar cipher, graph coloring, decimal arithmetic ( #304 )
...
* add metadata for caesar cipher, graph coloring, decimal arithmetic
* delete comma
* clean up variables
2025-03-09 18:08:56 +01:00
vncntt
1f360917fc
Caesar cipher curriculum ( #302 )
...
* caesar cipher curriculum + tests
2025-03-09 08:23:32 +01:00
vncntt
2bcc06717c
graph color curriculum ( #303 )
2025-03-09 08:20:47 +01:00
Zafir Stojanovski
b58371c533
ransom note curriculum ( #300 )
...
Co-authored-by: Andreas Köpf <andreas.koepf@xamla.com>
2025-03-08 21:00:13 +01:00
Zafir Stojanovski
501f2d8322
palindrome partitioning curriculum ( #299 )
...
Co-authored-by: Andreas Köpf <andreas.koepf@xamla.com>
2025-03-08 20:58:59 +01:00
Zafir Stojanovski
9f57132c2c
pool matrix curriculum ( #298 )
2025-03-08 20:57:22 +01:00
Zafir Stojanovski
cbd223d7ca
rotten oranges curriculum ( #297 )
2025-03-08 20:56:46 +01:00
Zafir Stojanovski
e4e516a949
spiral matrix curriculum ( #296 )
2025-03-08 20:56:08 +01:00
Andreas Köpf
d82c73b6f8
Show curricula ( #295 )
...
* feat: Add debug_curricula.py script to generate CURRICULA.md with dataset curriculum details
2025-03-08 14:21:50 +01:00
Zafir Stojanovski
9525f2588b
rotate matrix curriculum ( #294 )
2025-03-08 01:58:54 +01:00
Zafir Stojanovski
f8d4914cf5
manipulate matrix curriculum ( #293 )
2025-03-08 01:57:37 +01:00
Zafir Stojanovski
03ce955eab
feat(env): Isomorphic Strings Curriculum ( #292 )
...
* isomorphic strings curriculum
---------
Co-authored-by: Andreas Köpf <andreas.koepf@xamla.com>
2025-03-08 01:56:14 +01:00
joesharratt1229
88a3d065bd
added word sort curriculum ( #289 )
2025-03-08 01:50:13 +01:00
Zafir Stojanovski
e8601a63b4
feat(env): Group Anagrams Curriculum ( #288 )
...
* group anagrams curriculum
2025-03-08 01:49:12 +01:00
Zafir Stojanovski
07eb434d61
feat(env): Count Primes Curriculum ( #287 )
...
* count primes curriculum
2025-03-08 01:48:00 +01:00
Zafir Stojanovski
488b72f6f1
base conversion curriculum ( #286 )
2025-03-08 01:46:32 +01:00
Zafir Stojanovski
dc657b5ed4
feat(env): Binary Matrix Curriculum ( #279 )
...
* binary matrix curriculum
* register BinaryMatrixCurriculum
---------
Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-07 22:58:47 +01:00
Zafir Stojanovski
dfc28c94d6
feat(env): Binary Alternation Curriculum ( #278 )
...
* binary alternation
---------
Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-07 22:44:32 +01:00
Andreas Koepf
2b1f7ce5ee
use relative import for reasoning_gym.data
2025-03-07 15:56:45 +01:00
Rich Jones
11c9790a25
[Env] Game of Life Halting Prediction ( #272 )
...
This is a variant of the Game of Life task, which rather than trying to test the algorithmic simulation, tests the ability of the model to do explanatory reasoning of the board. The idea is that a model with good explanatory reasoning will be able to see that a game will not halt without simulating it into the future.
The task presents a GoL board, and the model is asked to predict if the board will halt (die, all cells zero) after n steps. Sometimes, the board will be made up of 'oscillators', isolated structures which never die. Othertimes, it is filled with non-oscillators, structures which will always die after a few steps. The model should deduce which case the presented board is.
2025-03-07 10:05:12 +01:00
joesharratt1229
1893691c57
updated algorithmics dataset ( #269 )
...
* updated algorithmic datasets
* added changes to symbolic and power
* updated power function test
2025-03-05 23:32:53 +01:00
Andreas Köpf
b2904ccab9
Minor question template & score_answer improvements ( #261 )
...
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
Rich Jones
e3b7365f50
Game of Life partial scoring and rule-clarification ( #258 )
...
* partial scoring and rule clarification
* better ql scoring
* word seq reverse typos
2025-03-03 22:22:39 +01:00
Zafir Stojanovski
2f9d94c1e7
fix: Unify Prompts ( #254 )
...
* remove cot
* fix prompt template
* fix pool matrix
* spiral matrix fixed
2025-03-03 21:55:53 +01:00