Commit graph

117 commits

Author SHA1 Message Date
joesharratt1229
735a3b5cc4 added sokoban dataset (#325) 2025-03-11 00:21:03 +01:00
joesharratt1229
cb6d9dbba0 added tsumego curric (#323) 2025-03-11 00:19:55 +01:00
joesharratt1229
d0faada7a0 added futoshiki and tower hanou (#316)
* added futoshiki and tower hanou

* corrected failed unit tests
2025-03-11 00:12:32 +01:00
joesharratt1229
5e0f9b4a4b Curriculum/emoji mystery (#315)
* added emoji curriculum

* updated metadata

* added curriculum to register
2025-03-11 00:11:27 +01:00
Zafir Stojanovski
52249f4d20 mini sudoku curriculum (#311) 2025-03-10 00:29:53 +01:00
Andreas Koepf
ce55d528ad register MahjongPuzzleCurriculum 2025-03-07 19:17:04 +01:00
Zafir Stojanovski
a48ff14507 add difficulty where possible (#274) 2025-03-07 19:01:26 +01:00
Andreas Köpf
c2263979bc Basic curriculum (#198)
* feat: Add optional curriculum support to dataset registration and creation
* docs: Add docstrings to create_curriculum() and register_dataset()
* feat: Add curriculum configuration classes for CurriculumExperiment
* feat: Add weight parameter to CurriculumAttributeConfig and use in DatasetSpec
* refactor: Simplify CurriculumAttributeConfig with "*" attribute level support
* test: Add unit tests for CurriculumExperiment class
* feat: Add from_yaml() method to CurriculumExperimentConfig with unit test
2025-03-07 11:22:12 +01:00
joesharratt1229
2c524c0c6f Added puzzle24 closes #208 (#268)
* added puzzle24
2025-03-05 22:36:37 +01:00
joesharratt1229
7458dbc95d Fixed countdown score_answer (#265)
* fixed countdown score ans
* checked solution uses all numbers
2025-03-05 22:30:12 +01:00
Zafir Stojanovski
3c544aba20 feat(env): Mahjong Puzzle Curriculum (#263)
* mahjong curriculum

* typo

* update levels
2025-03-05 22:28:02 +01:00
Zafir Stojanovski
19ca54da72 feat(env): NQueens Curriculum (#262)
* curriculum & tests
2025-03-05 15:05:17 +01:00
Andreas Köpf
b2904ccab9 Minor question template & score_answer improvements (#261)
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
Zafir Stojanovski
2f9d94c1e7 fix: Unify Prompts (#254)
* remove cot
* fix prompt template
* fix pool matrix
* spiral matrix fixed
2025-03-03 21:55:53 +01:00
Zafir Stojanovski
78c92d7056 Mahjong Puzzle (#241)
* mahjong
2025-03-01 16:27:26 +01:00
Andreas Koepf
2362b52d24 add markdown tripple backticks around tsumego board 2025-02-26 19:39:05 +01:00
Andreas Koepf
2ddcb7c3c7 fix & simplify score_answer() of TsumegoDataset 2025-02-26 19:04:30 +01:00
Andreas Koepf
74f590e24f more native type hints 2025-02-21 21:23:14 +01:00
Andreas Koepf
d27ec36c94 add markdown tripple backtick code block for emoji_mystry hint 2025-02-21 21:06:07 +01:00
Andreas Köpf
e41b86ec36 Merge branch 'main' into feat/emoji-mystery 2025-02-21 20:58:39 +01:00
joesharratt1229
f7be02abfc added answer format spec in prompt 2025-02-21 18:03:05 +00:00
joesharratt1229
425ae24f3b added emoji dataset 2025-02-21 17:57:41 +00:00
Andreas Köpf
28dc0932c4 Merge pull request #178 from olliestanley/feature/unsloth-train
Add minimal working GRPO training example with Unsloth
2025-02-21 15:37:24 +01:00
Andreas Koepf
ff5b210106 use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
Oliver
31941d09e6 Answer scoring fixes to address edge cases 2025-02-20 22:04:01 +00:00
Andreas Köpf
9fb231dde9 Merge pull request #161 from olliestanley/fix/sudoku-unique
Fix Sudoku generator for uniqueness, implement scoring
2025-02-18 22:55:43 +01:00
Oliver
47321936a5 Add docstring 2025-02-18 21:38:46 +00:00
Oliver
47b4f29c6a Remove now redundant is_valid function 2025-02-18 21:37:37 +00:00
Oliver
43ccddf1ac Remove comment 2025-02-18 21:32:15 +00:00
Oliver
368d13d470 Optimise Sudoku uniqueness checks 2025-02-18 21:30:59 +00:00
Oliver
c1d2e555ee Fix Sudoku generator uniqueness and scoring 2025-02-18 21:02:49 +00:00
Oliver
8a7c782c73 Tweak mini sudoku config 2025-02-18 18:46:14 +00:00
Oliver
90a77d0f5a Tweak mini sudoku config 2025-02-18 18:43:19 +00:00
Oliver
0ccb3cbdfd Tweak num_empty logic 2025-02-18 18:36:12 +00:00
Oliver
bf4c3d26d3 Ensure unique mini sudokus 2025-02-18 18:31:30 +00:00
Oliver
b40c44059d Cleanup question & add scoring for mini sudoku 2025-02-17 18:37:07 +00:00
Andreas Koepf
99b49f868f fix question templates 2025-02-16 23:04:24 +01:00
joesharratt1229
2c032a2500 restrcutured maze prompt template 2025-02-16 18:26:24 +00:00
joesharratt1229
94e07ddbf2 updated tower of hanoi question template 2025-02-16 17:54:33 +00:00
joesharratt1229
ca07c8584e updated countdown question template 2025-02-16 17:53:54 +00:00
Andreas Koepf
5803a2962e more tolerant parsing of futoshiki answers 2025-02-16 14:23:40 +01:00
Andreas Koepf
ac1f889d83 import FutoshikiDataset & update GALLERY 2025-02-15 21:26:15 +01:00
Andreas Köpf
ddb334ba20 Merge pull request #60 from olliestanley/feature/futoshiki-generator
Add Futoshiki puzzle generator
2025-02-15 21:24:15 +01:00
Zafir Stojanovski
95d367bc17 fix score function and add test 2025-02-15 16:09:08 +01:00
Oliver
d42b84ef4c Add more instruction to generated questions 2025-02-15 13:47:54 +00:00
Zafir Stojanovski
b8036a4b7d fix prompt 2025-02-15 10:31:03 +01:00
Andreas Koepf
14f67cb7e7 fix handling of walls, add unit test 2025-02-14 23:29:17 +01:00
Andreas Koepf (aider)
e106bd311c refactor: Use Path.open() to read rush_18k.txt line by line 2025-02-14 17:41:09 +01:00
Andreas Koepf
1d1a0367d4 add RushHour to games/__init__.py 2025-02-14 17:35:14 +01:00
Andreas Koepf (aider)
d5ce2bfdf5 feat: Add score_answer method to RushHourDataset with input validation 2025-02-14 17:33:01 +01:00