Zafir Stojanovski
3fbfa82afb
Merge branch 'main' of https://github.com/open-thought/reasoning-gym into env/spiral-matrix
2025-02-08 13:15:02 +01:00
joesharratt1229
5d17a6b31c
added testing of rearc
2025-02-08 11:42:49 +00:00
joesharratt1229
4023efd311
added lne length constraint to ea32f347 task
2025-02-08 11:42:49 +00:00
joesharratt1229
a1494c4e5b
adapted score answer
2025-02-08 11:42:49 +00:00
joesharratt1229
ab33c2bcbb
added parse func
2025-02-08 11:42:49 +00:00
joesharratt1229
65b61912ba
fixed rng err
2025-02-08 11:42:49 +00:00
joesharratt1229
4d9ea46c85
added validation
2025-02-08 11:42:49 +00:00
joesharratt1229
4c94975d15
added generator directory
2025-02-08 11:42:49 +00:00
joesharratt1229
b28283a31b
added rearc score answer and visualisation methods
2025-02-08 11:42:49 +00:00
joesharratt1229
4f74d73994
added rearc board format
2025-02-08 11:42:49 +00:00
joesharratt1229
f10236a64c
fixed relative imports
2025-02-08 11:42:49 +00:00
joesharratt1229
184d5b4270
registered rearc
2025-02-08 11:42:49 +00:00
joesharratt1229
e836add270
added rearc impl
2025-02-08 11:42:40 +00:00
Andreas Köpf
def66e0d40
Merge pull request #84 from JeanKaddour/main
...
Refactor: Add more Docstrings and Examples to Tsumego
2025-02-08 10:52:18 +01:00
Jean Kaddour
b34964be06
chore: run pre-commit
2025-02-08 08:33:21 +00:00
Jean Kaddour
8e02b363c1
Update GALLERY.md
2025-02-08 08:26:25 +00:00
Jean Kaddour
a2515ad9c7
make formatting consistent
2025-02-07 23:07:29 +00:00
Jean Kaddour
64b96b5fff
refactor: add more docstrings and examples to tsumego
2025-02-07 23:02:57 +00:00
Andreas Köpf
0c8752c7b1
Fix syllogisms ( #82 )
...
* let o1 write a new is_valid_syllogism() check
* extend unit test
* update gallery
2025-02-07 21:47:59 +01:00
Andreas Koepf
ff74dfb5f2
fix tool.hatch.build section in pyproject.toml
2025-02-07 19:02:43 +01:00
Andreas Koepf
f522cbb349
use full link to gallery for PyPI
2025-02-07 18:29:45 +01:00
Andreas Koepf
d3752a0d76
bump version to 0.1.14
2025-02-07 18:28:06 +01:00
Andreas Köpf
eb8b7afea4
Merge pull request #74 from zafstojano/env/isomorphic-strings
...
Isomorphic Strings
2025-02-07 18:25:09 +01:00
Zafir Stojanovski
d78ce0a9f7
isomorphic strings
2025-02-07 18:23:34 +01:00
Andreas Köpf
0a3d9b6bf2
Merge pull request #79 from open-thought/rich/selfref
...
Adds Self-Reference Logic Puzzles
2025-02-07 17:57:28 +01:00
Andreas Koepf
848997ee47
add complex_arithmetic
2025-02-07 17:53:30 +01:00
Andreas Koepf
51a975e753
Merge branch 'idigitopia-add-complex-number-arithmetic'
2025-02-07 17:49:46 +01:00
Aayam
2170ff1c23
pre commit run changes
2025-02-07 07:42:03 -08:00
Aayam
7ddc6390e6
All number are integers now.
2025-02-07 07:34:17 -08:00
Aayam
e2ce20bcbb
Apply pre-commit fixes
2025-02-07 07:01:20 -08:00
Aayam
f93c00a16b
added explicit check for answer to metadata result match
2025-02-07 07:01:20 -08:00
Andreas Köpf
258f844fc6
Merge branch 'main' into rich/selfref
2025-02-07 15:57:00 +01:00
Andreas Köpf
9cea106a90
Merge pull request #80 from open-thought/tsumego_tweaks
...
Add GO hints, legend, disallow numeric answer, store expected string …
2025-02-07 15:56:07 +01:00
Andreas Koepf
4eff39dde5
adapt answer format to numbering in board output display
2025-02-07 15:54:01 +01:00
Andreas Koepf
3b19bc8469
expect full entry for score_answer
2025-02-07 15:26:39 +01:00
Andreas Koepf
81cb7aa42b
Add GO hints, legend, disallow numeric answer, store expected string answer
2025-02-07 15:20:00 +01:00
Rich Jones
bd8fc9beeb
add self-reference puzzles
2025-02-07 15:09:42 +01:00
Zafir Stojanovski
747f0dbb13
pre-commit
2025-02-07 14:48:07 +01:00
Zafir Stojanovski
b24da41e69
ransom note
2025-02-07 14:47:00 +01:00
Andreas Köpf
2458d3a646
Merge pull request #78 from JeanKaddour/main
...
Feat: Add Tsumego
2025-02-07 14:10:29 +01:00
tohskai
847442ef0a
Add PolynomialMultiplicationDataset ( #64 )
...
* Add PolynomialMultiplicationDataset
2025-02-07 14:06:41 +01:00
Zafir Stojanovski
0a20a2e582
spiral matrix
2025-02-07 12:46:36 +01:00
Jean Kaddour
f625b9a68f
feat: add tsumego
2025-02-07 11:22:33 +00:00
Andreas Köpf
426fa22fcc
Sokoban without pygame ( #77 )
...
* add minified version of https://github.com/xbandrade/sokoban-solver-generator
---------
Co-authored-by: Rich Jones <miserlou@gmail.com>
2025-02-07 11:57:53 +01:00
Andreas Köpf
7b72c3470b
docs: Update TRL README with GRPO example details and usage instructions ( #76 )
2025-02-07 07:56:22 +01:00
joesharratt1229
a8e11e71be
Test training with trl ( #70 )
...
* first trl grpo implementation
* added config yaml file
* added read me and dependencies
* updated reward format func
2025-02-07 07:42:32 +01:00
Oliver
145ceeb109
Finish first draft futoshiki solver/gen
2025-02-07 00:09:35 +00:00
Oliver
238a41db43
Revert "Experiment with alternative solving/generation approach"
...
This reverts commit 06c75ce1a9 .
2025-02-07 00:06:38 +00:00
Oliver
06c75ce1a9
Experiment with alternative solving/generation approach
2025-02-06 23:58:09 +00:00
Andreas Köpf
3f6b2fc807
Add Coaching & ScoreBoard class (result tracking) ( #72 )
...
* feat: Add Coach and ScoreBoard classes for performance tracking and difficulty adjustment
* feat: Add GroupedScores class to wrap aggregated scores
* refactor: Create ScoreStats class with tuple-based score statistics
* feat: Add unit test for Coach with CompositeDataset and multiple datasets
* fix: Add difficulty metadata to leg counting dataset
* feat: Add clear() method to ScoreBoard to reset all stored data
* feat: Add __len__ method to ScoreBoard to return number of scores
* feat: Add update_dataset_config method to CompositeDataset
* cleanup __init__ & imports
2025-02-06 23:15:28 +01:00