Andreas Köpf
e9d87a6933
Merge branch 'main' into env/rotate-matrix
2025-02-08 17:42:04 +01:00
Andreas Köpf
cf4bc09270
Merge pull request #86 from zafstojano/env/ransom-note
...
Ransom Note
2025-02-08 17:39:57 +01:00
Andreas Köpf
f8449b2e1a
Merge branch 'main' into env/ransom-note
2025-02-08 17:34:31 +01:00
Andreas Koepf
0c3b2c4fef
lint
2025-02-08 17:22:55 +01:00
Andreas Koepf (aider)
ac27508d09
feat: Add inversion probability and logical equivalence to syllogisms
2025-02-08 17:14:35 +01:00
Andreas Koepf
f2e02d6d08
add CONTRIBUTING.md, simplify README.md
2025-02-08 15:59:44 +01:00
Andreas Koepf (aider)
78212450c8
docs: Improve CONTRIBUTING.md with better formatting, clarity, and organization
2025-02-08 15:49:48 +01:00
Andreas Koepf
247ea4d8eb
docs: Add CONTRIBUTING.md guidelines for project contributions
2025-02-08 15:49:46 +01:00
Zafir Stojanovski
208cb3b1c4
remove added empty line in GALLERY.md
2025-02-08 15:22:33 +01:00
Zafir Stojanovski
2c40e655f9
update docs
2025-02-08 15:21:11 +01:00
Zafir Stojanovski
42312fe786
generlize to k rotations
2025-02-08 15:14:04 +01:00
Zafir Stojanovski
49587a9a63
remove GALLERY.md stuff
2025-02-08 14:50:06 +01:00
Zafir Stojanovski
807dad12de
remove stuff from GALLERY.md
2025-02-08 14:47:26 +01:00
Zafir Stojanovski
fe83bee725
rotate matrix
2025-02-08 14:27:10 +01:00
Zafir Stojanovski
b8cc814e7f
Merge branch 'main' of https://github.com/open-thought/reasoning-gym into env/ransom-note
2025-02-08 13:19:37 +01:00
Andreas Köpf
def66e0d40
Merge pull request #84 from JeanKaddour/main
...
Refactor: Add more Docstrings and Examples to Tsumego
2025-02-08 10:52:18 +01:00
Jean Kaddour
b34964be06
chore: run pre-commit
2025-02-08 08:33:21 +00:00
Jean Kaddour
8e02b363c1
Update GALLERY.md
2025-02-08 08:26:25 +00:00
Jean Kaddour
a2515ad9c7
make formatting consistent
2025-02-07 23:07:29 +00:00
Jean Kaddour
64b96b5fff
refactor: add more docstrings and examples to tsumego
2025-02-07 23:02:57 +00:00
Andreas Köpf
0c8752c7b1
Fix syllogisms ( #82 )
...
* let o1 write a new is_valid_syllogism() check
* extend unit test
* update gallery
2025-02-07 21:47:59 +01:00
Andreas Koepf
ff74dfb5f2
fix tool.hatch.build section in pyproject.toml
2025-02-07 19:02:43 +01:00
Andreas Koepf
f522cbb349
use full link to gallery for PyPI
2025-02-07 18:29:45 +01:00
Andreas Koepf
d3752a0d76
bump version to 0.1.14
2025-02-07 18:28:06 +01:00
Andreas Köpf
eb8b7afea4
Merge pull request #74 from zafstojano/env/isomorphic-strings
...
Isomorphic Strings
2025-02-07 18:25:09 +01:00
Zafir Stojanovski
d78ce0a9f7
isomorphic strings
2025-02-07 18:23:34 +01:00
Andreas Köpf
0a3d9b6bf2
Merge pull request #79 from open-thought/rich/selfref
...
Adds Self-Reference Logic Puzzles
2025-02-07 17:57:28 +01:00
Andreas Koepf
848997ee47
add complex_arithmetic
2025-02-07 17:53:30 +01:00
Andreas Koepf
51a975e753
Merge branch 'idigitopia-add-complex-number-arithmetic'
2025-02-07 17:49:46 +01:00
Aayam
2170ff1c23
pre commit run changes
2025-02-07 07:42:03 -08:00
Aayam
7ddc6390e6
All number are integers now.
2025-02-07 07:34:17 -08:00
Aayam
e2ce20bcbb
Apply pre-commit fixes
2025-02-07 07:01:20 -08:00
Aayam
f93c00a16b
added explicit check for answer to metadata result match
2025-02-07 07:01:20 -08:00
Andreas Köpf
258f844fc6
Merge branch 'main' into rich/selfref
2025-02-07 15:57:00 +01:00
Andreas Köpf
9cea106a90
Merge pull request #80 from open-thought/tsumego_tweaks
...
Add GO hints, legend, disallow numeric answer, store expected string …
2025-02-07 15:56:07 +01:00
Andreas Koepf
4eff39dde5
adapt answer format to numbering in board output display
2025-02-07 15:54:01 +01:00
Andreas Koepf
3b19bc8469
expect full entry for score_answer
2025-02-07 15:26:39 +01:00
Andreas Koepf
81cb7aa42b
Add GO hints, legend, disallow numeric answer, store expected string answer
2025-02-07 15:20:00 +01:00
Rich Jones
bd8fc9beeb
add self-reference puzzles
2025-02-07 15:09:42 +01:00
Zafir Stojanovski
b24da41e69
ransom note
2025-02-07 14:47:00 +01:00
Andreas Köpf
2458d3a646
Merge pull request #78 from JeanKaddour/main
...
Feat: Add Tsumego
2025-02-07 14:10:29 +01:00
tohskai
847442ef0a
Add PolynomialMultiplicationDataset ( #64 )
...
* Add PolynomialMultiplicationDataset
2025-02-07 14:06:41 +01:00
Jean Kaddour
f625b9a68f
feat: add tsumego
2025-02-07 11:22:33 +00:00
Andreas Köpf
426fa22fcc
Sokoban without pygame ( #77 )
...
* add minified version of https://github.com/xbandrade/sokoban-solver-generator
---------
Co-authored-by: Rich Jones <miserlou@gmail.com>
2025-02-07 11:57:53 +01:00
Andreas Köpf
7b72c3470b
docs: Update TRL README with GRPO example details and usage instructions ( #76 )
2025-02-07 07:56:22 +01:00
joesharratt1229
a8e11e71be
Test training with trl ( #70 )
...
* first trl grpo implementation
* added config yaml file
* added read me and dependencies
* updated reward format func
2025-02-07 07:42:32 +01:00
Andreas Köpf
3f6b2fc807
Add Coaching & ScoreBoard class (result tracking) ( #72 )
...
* feat: Add Coach and ScoreBoard classes for performance tracking and difficulty adjustment
* feat: Add GroupedScores class to wrap aggregated scores
* refactor: Create ScoreStats class with tuple-based score statistics
* feat: Add unit test for Coach with CompositeDataset and multiple datasets
* fix: Add difficulty metadata to leg counting dataset
* feat: Add clear() method to ScoreBoard to reset all stored data
* feat: Add __len__ method to ScoreBoard to return number of scores
* feat: Add update_dataset_config method to CompositeDataset
* cleanup __init__ & imports
2025-02-06 23:15:28 +01:00
Andreas Köpf
7c08c05b1e
Merge pull request #75 from zafstojano/chore/remove-unused-methods
...
chore(envs): Remove unmodified dunder methods
2025-02-06 23:12:43 +01:00
Zafir Stojanovski
b5a820a8d9
remove unmodified dunder methods
2025-02-06 22:56:11 +01:00
Andreas Koepf
071c22a809
remove redundant methods from GroupAnagramsDataset
2025-02-06 14:21:03 +01:00