Zafir Stojanovski
7c7c783883
string manipulation curriculum ( #306 )
2025-03-09 18:12:35 +01:00
Zafir Stojanovski
e1e05884ee
string insertion curriculum ( #305 )
2025-03-09 18:11:29 +01:00
vncntt
1f360917fc
Caesar cipher curriculum ( #302 )
...
* caesar cipher curriculum + tests
2025-03-09 08:23:32 +01:00
vncntt
2bcc06717c
graph color curriculum ( #303 )
2025-03-09 08:20:47 +01:00
Zafir Stojanovski
b58371c533
ransom note curriculum ( #300 )
...
Co-authored-by: Andreas Köpf <andreas.koepf@xamla.com>
2025-03-08 21:00:13 +01:00
Zafir Stojanovski
501f2d8322
palindrome partitioning curriculum ( #299 )
...
Co-authored-by: Andreas Köpf <andreas.koepf@xamla.com>
2025-03-08 20:58:59 +01:00
Zafir Stojanovski
9f57132c2c
pool matrix curriculum ( #298 )
2025-03-08 20:57:22 +01:00
Zafir Stojanovski
cbd223d7ca
rotten oranges curriculum ( #297 )
2025-03-08 20:56:46 +01:00
Zafir Stojanovski
e4e516a949
spiral matrix curriculum ( #296 )
2025-03-08 20:56:08 +01:00
Andreas Köpf
d82c73b6f8
Show curricula ( #295 )
...
* feat: Add debug_curricula.py script to generate CURRICULA.md with dataset curriculum details
2025-03-08 14:21:50 +01:00
Zafir Stojanovski
9525f2588b
rotate matrix curriculum ( #294 )
2025-03-08 01:58:54 +01:00
Zafir Stojanovski
f8d4914cf5
manipulate matrix curriculum ( #293 )
2025-03-08 01:57:37 +01:00
Zafir Stojanovski
03ce955eab
feat(env): Isomorphic Strings Curriculum ( #292 )
...
* isomorphic strings curriculum
---------
Co-authored-by: Andreas Köpf <andreas.koepf@xamla.com>
2025-03-08 01:56:14 +01:00
joesharratt1229
e2e094376e
added power function exponent ( #291 )
...
* added power function exponent
* register PowerFunctionCurriculum
---------
Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-08 01:54:36 +01:00
joesharratt1229
88a3d065bd
added word sort curriculum ( #289 )
2025-03-08 01:50:13 +01:00
Zafir Stojanovski
e8601a63b4
feat(env): Group Anagrams Curriculum ( #288 )
...
* group anagrams curriculum
2025-03-08 01:49:12 +01:00
Zafir Stojanovski
07eb434d61
feat(env): Count Primes Curriculum ( #287 )
...
* count primes curriculum
2025-03-08 01:48:00 +01:00
Zafir Stojanovski
488b72f6f1
base conversion curriculum ( #286 )
2025-03-08 01:46:32 +01:00
vncntt
6d0b219412
dice curriculum ( #284 )
...
* curriculum + unit tests
* add difficulty to metadata
---------
Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-08 01:43:45 +01:00
vncntt
d1c06e9f98
Calendar arithmetic curriculum ( #283 )
...
* calendar arithmetic curriculum
* add difficulty to metadata
* register CalendarArithmeticCurriculum
---------
Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-08 01:38:22 +01:00
vncntt
9438cec490
Bitwise arithmetic curriculum ( #282 )
...
* bitwise_arithmetic curriculum
* register BitwiseArithmeticCurriculum
---------
Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-08 01:32:00 +01:00
joesharratt1229
e304b20e24
added Decimal curriculum ( #280 )
...
* added decimal curricula
* added chain sum decimal curriculum
* register DecimalArithmeticCurriculum & DecimalChainSumCurriculum
---------
Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-07 23:02:57 +01:00
Zafir Stojanovski
dc657b5ed4
feat(env): Binary Matrix Curriculum ( #279 )
...
* binary matrix curriculum
* register BinaryMatrixCurriculum
---------
Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-07 22:58:47 +01:00
joesharratt1229
98def56bb4
added basic arith curricula ( #276 )
...
* added basic arith curricula
* register BasicArithmeticCurriculum
---------
Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-07 22:54:49 +01:00
Zafir Stojanovski
dfc28c94d6
feat(env): Binary Alternation Curriculum ( #278 )
...
* binary alternation
---------
Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-07 22:44:32 +01:00
Zafir Stojanovski
0fb90ce8c4
feat(env): Leg Counting Curriculum ( #275 )
...
* leg counting curriculum
---------
Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-07 19:15:18 +01:00
Andreas Köpf
c2263979bc
Basic curriculum ( #198 )
...
* feat: Add optional curriculum support to dataset registration and creation
* docs: Add docstrings to create_curriculum() and register_dataset()
* feat: Add curriculum configuration classes for CurriculumExperiment
* feat: Add weight parameter to CurriculumAttributeConfig and use in DatasetSpec
* refactor: Simplify CurriculumAttributeConfig with "*" attribute level support
* test: Add unit tests for CurriculumExperiment class
* feat: Add from_yaml() method to CurriculumExperimentConfig with unit test
2025-03-07 11:22:12 +01:00
Rich Jones
34889d0517
Add Modulo Grid Task ( #273 )
...
* add modulo_grid dataset
* ensure the pattern is mathematical, not just spatial
---------
Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-07 11:11:41 +01:00
Rich Jones
11c9790a25
[Env] Game of Life Halting Prediction ( #272 )
...
This is a variant of the Game of Life task, which rather than trying to test the algorithmic simulation, tests the ability of the model to do explanatory reasoning of the board. The idea is that a model with good explanatory reasoning will be able to see that a game will not halt without simulating it into the future.
The task presents a GoL board, and the model is asked to predict if the board will halt (die, all cells zero) after n steps. Sometimes, the board will be made up of 'oscillators', isolated structures which never die. Othertimes, it is filled with non-oscillators, structures which will always die after a few steps. The model should deduce which case the presented board is.
2025-03-07 10:05:12 +01:00
joesharratt1229
1893691c57
updated algorithmics dataset ( #269 )
...
* updated algorithmic datasets
* added changes to symbolic and power
* updated power function test
2025-03-05 23:32:53 +01:00
Zafir Stojanovski
f843ac1b82
shortest path curriculum ( #271 )
2025-03-05 22:46:10 +01:00
Zafir Stojanovski
a048084009
largest island curriculum ( #270 )
2025-03-05 22:45:35 +01:00
Zafir Stojanovski
3d9bb382aa
feat(env): Count Bits Curriculum ( #267 )
...
* add min n
* count bits
2025-03-05 22:44:04 +01:00
Zafir Stojanovski
84158df1c7
feat(env): Course Schedule Curriculum ( #266 )
...
* course schedule curriculum
* update levels
* update comments
* lint
2025-03-05 22:42:46 +01:00
joesharratt1229
2c524c0c6f
Added puzzle24 closes #208 ( #268 )
...
* added puzzle24
2025-03-05 22:36:37 +01:00
Oliver Stanley
3286a68361
First version of CodeI/O reasoning data ( #264 )
...
* notebook for prepping first set of raw code files
* updated codeio processing notebook for repo-level processing
* fix for edge case in codeio scoring
* Add reformat notebook
* filtering pass
* add non-determinism filtering
* Tweak CodeIODataset & include first real data
* add basic codeio test, metadata
2025-03-05 22:34:11 +01:00
joesharratt1229
7458dbc95d
Fixed countdown score_answer ( #265 )
...
* fixed countdown score ans
* checked solution uses all numbers
2025-03-05 22:30:12 +01:00
Zafir Stojanovski
3c544aba20
feat(env): Mahjong Puzzle Curriculum ( #263 )
...
* mahjong curriculum
* typo
* update levels
2025-03-05 22:28:02 +01:00
Zafir Stojanovski
19ca54da72
feat(env): NQueens Curriculum ( #262 )
...
* curriculum & tests
2025-03-05 15:05:17 +01:00
Andreas Köpf
b2904ccab9
Minor question template & score_answer improvements ( #261 )
...
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
Rich Jones
e3b7365f50
Game of Life partial scoring and rule-clarification ( #258 )
...
* partial scoring and rule clarification
* better ql scoring
* word seq reverse typos
2025-03-03 22:22:39 +01:00
vncntt
8992037ecc
fixed problems in knights_knaves ( #251 )
...
* remove unnecessary variables
* added depth logic
* add depth tests
2025-03-02 08:47:54 +01:00
Andreas Köpf
ece6990709
Remove strip from ProceduralDataset::core score_answer() ( #250 )
...
* remove strip from ProceduralDataset::core score_answer(), strip in extract answer (optional, default=True)
* test: Move test_extract_answer() from test_dataset.py to test_utils.py
* refactor: Improve decimal reward computation with more flexible comparison
* fix: Implement rounding for format_number when round_if_needed is True
* test: Add test case for compute_decimal_reward with sign and zeros
2025-03-02 08:46:36 +01:00
Zafir Stojanovski
1bc9f6f09f
fix manipulate matrix ( #247 )
2025-03-01 23:00:29 +01:00
Rich Jones
80aafda8e5
more dynamic scoring for jumble ( #246 )
2025-03-01 18:50:59 +01:00
Zafir Stojanovski
78c92d7056
Mahjong Puzzle ( #241 )
...
* mahjong
2025-03-01 16:27:26 +01:00
Andreas Köpf
ed90fff3fa
Merge pull request #220 from open-thought/rich/cubeinstructions
...
Make Rubiks Cube Output Format More Explicit
2025-02-27 12:16:09 +01:00
Rich Jones
b2b2311329
seed test config
2025-02-27 10:44:28 +01:00
Rich Jones
9daaccc208
expand more
2025-02-27 10:41:30 +01:00
Andreas Koepf (aider)
a92dcd4a75
feat: Add comprehensive unit tests for parse_string_to_complex() method
2025-02-26 21:44:32 +01:00