joesharratt1229
735a3b5cc4
added sokoban dataset ( #325 )
2025-03-11 00:21:03 +01:00
joesharratt1229
cb6d9dbba0
added tsumego curric ( #323 )
2025-03-11 00:19:55 +01:00
joesharratt1229
d0faada7a0
added futoshiki and tower hanou ( #316 )
...
* added futoshiki and tower hanou
* corrected failed unit tests
2025-03-11 00:12:32 +01:00
joesharratt1229
5e0f9b4a4b
Curriculum/emoji mystery ( #315 )
...
* added emoji curriculum
* updated metadata
* added curriculum to register
2025-03-11 00:11:27 +01:00
Zafir Stojanovski
52249f4d20
mini sudoku curriculum ( #311 )
2025-03-10 00:29:53 +01:00
Andreas Koepf
ce55d528ad
register MahjongPuzzleCurriculum
2025-03-07 19:17:04 +01:00
Zafir Stojanovski
a48ff14507
add difficulty where possible ( #274 )
2025-03-07 19:01:26 +01:00
Andreas Köpf
c2263979bc
Basic curriculum ( #198 )
...
* feat: Add optional curriculum support to dataset registration and creation
* docs: Add docstrings to create_curriculum() and register_dataset()
* feat: Add curriculum configuration classes for CurriculumExperiment
* feat: Add weight parameter to CurriculumAttributeConfig and use in DatasetSpec
* refactor: Simplify CurriculumAttributeConfig with "*" attribute level support
* test: Add unit tests for CurriculumExperiment class
* feat: Add from_yaml() method to CurriculumExperimentConfig with unit test
2025-03-07 11:22:12 +01:00
joesharratt1229
2c524c0c6f
Added puzzle24 closes #208 ( #268 )
...
* added puzzle24
2025-03-05 22:36:37 +01:00
joesharratt1229
7458dbc95d
Fixed countdown score_answer ( #265 )
...
* fixed countdown score ans
* checked solution uses all numbers
2025-03-05 22:30:12 +01:00
Zafir Stojanovski
3c544aba20
feat(env): Mahjong Puzzle Curriculum ( #263 )
...
* mahjong curriculum
* typo
* update levels
2025-03-05 22:28:02 +01:00
Zafir Stojanovski
19ca54da72
feat(env): NQueens Curriculum ( #262 )
...
* curriculum & tests
2025-03-05 15:05:17 +01:00
Andreas Köpf
b2904ccab9
Minor question template & score_answer improvements ( #261 )
...
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
Zafir Stojanovski
2f9d94c1e7
fix: Unify Prompts ( #254 )
...
* remove cot
* fix prompt template
* fix pool matrix
* spiral matrix fixed
2025-03-03 21:55:53 +01:00
Zafir Stojanovski
78c92d7056
Mahjong Puzzle ( #241 )
...
* mahjong
2025-03-01 16:27:26 +01:00
Andreas Koepf
2362b52d24
add markdown tripple backticks around tsumego board
2025-02-26 19:39:05 +01:00
Andreas Koepf
2ddcb7c3c7
fix & simplify score_answer() of TsumegoDataset
2025-02-26 19:04:30 +01:00
Andreas Koepf
74f590e24f
more native type hints
2025-02-21 21:23:14 +01:00
Andreas Koepf
d27ec36c94
add markdown tripple backtick code block for emoji_mystry hint
2025-02-21 21:06:07 +01:00
Andreas Köpf
e41b86ec36
Merge branch 'main' into feat/emoji-mystery
2025-02-21 20:58:39 +01:00
joesharratt1229
f7be02abfc
added answer format spec in prompt
2025-02-21 18:03:05 +00:00
joesharratt1229
425ae24f3b
added emoji dataset
2025-02-21 17:57:41 +00:00
Andreas Köpf
28dc0932c4
Merge pull request #178 from olliestanley/feature/unsloth-train
...
Add minimal working GRPO training example with Unsloth
2025-02-21 15:37:24 +01:00
Andreas Koepf
ff5b210106
use native types List->list, Dict->dict, Set->set, Tuple->tuple
2025-02-21 15:15:38 +01:00
Oliver
31941d09e6
Answer scoring fixes to address edge cases
2025-02-20 22:04:01 +00:00
Andreas Köpf
9fb231dde9
Merge pull request #161 from olliestanley/fix/sudoku-unique
...
Fix Sudoku generator for uniqueness, implement scoring
2025-02-18 22:55:43 +01:00
Oliver
47321936a5
Add docstring
2025-02-18 21:38:46 +00:00
Oliver
47b4f29c6a
Remove now redundant is_valid function
2025-02-18 21:37:37 +00:00
Oliver
43ccddf1ac
Remove comment
2025-02-18 21:32:15 +00:00
Oliver
368d13d470
Optimise Sudoku uniqueness checks
2025-02-18 21:30:59 +00:00
Oliver
c1d2e555ee
Fix Sudoku generator uniqueness and scoring
2025-02-18 21:02:49 +00:00
Oliver
8a7c782c73
Tweak mini sudoku config
2025-02-18 18:46:14 +00:00
Oliver
90a77d0f5a
Tweak mini sudoku config
2025-02-18 18:43:19 +00:00
Oliver
0ccb3cbdfd
Tweak num_empty logic
2025-02-18 18:36:12 +00:00
Oliver
bf4c3d26d3
Ensure unique mini sudokus
2025-02-18 18:31:30 +00:00
Oliver
b40c44059d
Cleanup question & add scoring for mini sudoku
2025-02-17 18:37:07 +00:00
Andreas Koepf
99b49f868f
fix question templates
2025-02-16 23:04:24 +01:00
joesharratt1229
2c032a2500
restrcutured maze prompt template
2025-02-16 18:26:24 +00:00
joesharratt1229
94e07ddbf2
updated tower of hanoi question template
2025-02-16 17:54:33 +00:00
joesharratt1229
ca07c8584e
updated countdown question template
2025-02-16 17:53:54 +00:00
Andreas Koepf
5803a2962e
more tolerant parsing of futoshiki answers
2025-02-16 14:23:40 +01:00
Andreas Koepf
ac1f889d83
import FutoshikiDataset & update GALLERY
2025-02-15 21:26:15 +01:00
Andreas Köpf
ddb334ba20
Merge pull request #60 from olliestanley/feature/futoshiki-generator
...
Add Futoshiki puzzle generator
2025-02-15 21:24:15 +01:00
Zafir Stojanovski
95d367bc17
fix score function and add test
2025-02-15 16:09:08 +01:00
Oliver
d42b84ef4c
Add more instruction to generated questions
2025-02-15 13:47:54 +00:00
Zafir Stojanovski
b8036a4b7d
fix prompt
2025-02-15 10:31:03 +01:00
Andreas Koepf
14f67cb7e7
fix handling of walls, add unit test
2025-02-14 23:29:17 +01:00
Andreas Koepf (aider)
e106bd311c
refactor: Use Path.open() to read rush_18k.txt line by line
2025-02-14 17:41:09 +01:00
Andreas Koepf
1d1a0367d4
add RushHour to games/__init__.py
2025-02-14 17:35:14 +01:00
Andreas Koepf (aider)
d5ce2bfdf5
feat: Add score_answer method to RushHourDataset with input validation
2025-02-14 17:33:01 +01:00