Jean Kaddour
4e52a919a8
make formatting consistent
2025-02-07 23:07:29 +00:00
Jean Kaddour
faaede6e8d
refactor: add more docstrings and examples to tsumego
2025-02-07 23:02:57 +00:00
Andreas Koepf
9887a1beed
fix tool.hatch.build section in pyproject.toml
2025-02-07 19:02:43 +01:00
Andreas Koepf
58c3a24ace
use full link to gallery for PyPI
2025-02-07 18:29:45 +01:00
Andreas Koepf
2a363c8610
bump version to 0.1.14
2025-02-07 18:28:06 +01:00
Andreas Köpf
a64fdb8130
Merge pull request #74 from zafstojano/env/isomorphic-strings
...
Isomorphic Strings
2025-02-07 18:25:09 +01:00
Zafir Stojanovski
0fbed2cf04
isomorphic strings
2025-02-07 18:23:34 +01:00
Andreas Köpf
c4674f9fb0
Merge pull request #79 from open-thought/rich/selfref
...
Adds Self-Reference Logic Puzzles
2025-02-07 17:57:28 +01:00
Andreas Koepf
f79d4ab5e8
add complex_arithmetic
2025-02-07 17:53:30 +01:00
Andreas Koepf
29ca3a1f2d
Merge branch 'idigitopia-add-complex-number-arithmetic'
2025-02-07 17:49:46 +01:00
Aayam
fa102721f2
pre commit run changes
2025-02-07 07:42:03 -08:00
Aayam
cf25ca94b9
All number are integers now.
2025-02-07 07:34:17 -08:00
Aayam
d675ea843e
Apply pre-commit fixes
2025-02-07 07:01:20 -08:00
Aayam
50b96bff73
added explicit check for answer to metadata result match
2025-02-07 07:01:20 -08:00
Andreas Köpf
f9ab3abdea
Merge branch 'main' into rich/selfref
2025-02-07 15:57:00 +01:00
Andreas Köpf
23d6bd84d6
Merge pull request #80 from open-thought/tsumego_tweaks
...
Add GO hints, legend, disallow numeric answer, store expected string …
2025-02-07 15:56:07 +01:00
Andreas Koepf
6883a5b8c6
adapt answer format to numbering in board output display
2025-02-07 15:54:01 +01:00
Andreas Koepf
18a9f3343e
expect full entry for score_answer
2025-02-07 15:26:39 +01:00
Andreas Koepf
b448f553a4
Add GO hints, legend, disallow numeric answer, store expected string answer
2025-02-07 15:20:00 +01:00
Rich Jones
dd53681724
add self-reference puzzles
2025-02-07 15:09:42 +01:00
Andreas Köpf
c3ba476b72
Merge pull request #78 from JeanKaddour/main
...
Feat: Add Tsumego
2025-02-07 14:10:29 +01:00
tohskai
998d142d9f
Add PolynomialMultiplicationDataset ( #64 )
...
* Add PolynomialMultiplicationDataset
2025-02-07 14:06:41 +01:00
Jean Kaddour
741118eb52
feat: add tsumego
2025-02-07 11:22:33 +00:00
Andreas Köpf
1b49713116
Sokoban without pygame ( #77 )
...
* add minified version of https://github.com/xbandrade/sokoban-solver-generator
---------
Co-authored-by: Rich Jones <miserlou@gmail.com>
2025-02-07 11:57:53 +01:00
Andreas Köpf
a8f9eafd43
docs: Update TRL README with GRPO example details and usage instructions ( #76 )
2025-02-07 07:56:22 +01:00
joesharratt1229
d61db3772a
Test training with trl ( #70 )
...
* first trl grpo implementation
* added config yaml file
* added read me and dependencies
* updated reward format func
2025-02-07 07:42:32 +01:00
Andreas Köpf
a607db79f7
Add Coaching & ScoreBoard class (result tracking) ( #72 )
...
* feat: Add Coach and ScoreBoard classes for performance tracking and difficulty adjustment
* feat: Add GroupedScores class to wrap aggregated scores
* refactor: Create ScoreStats class with tuple-based score statistics
* feat: Add unit test for Coach with CompositeDataset and multiple datasets
* fix: Add difficulty metadata to leg counting dataset
* feat: Add clear() method to ScoreBoard to reset all stored data
* feat: Add __len__ method to ScoreBoard to return number of scores
* feat: Add update_dataset_config method to CompositeDataset
* cleanup __init__ & imports
2025-02-06 23:15:28 +01:00
Andreas Köpf
05e2681ada
Merge pull request #75 from zafstojano/chore/remove-unused-methods
...
chore(envs): Remove unmodified dunder methods
2025-02-06 23:12:43 +01:00
Zafir Stojanovski
2dd53241e5
remove unmodified dunder methods
2025-02-06 22:56:11 +01:00
Andreas Koepf
b90c50e68f
remove redundant methods from GroupAnagramsDataset
2025-02-06 14:21:03 +01:00
Andreas Köpf
b23d25c92a
Merge pull request #65 from zafstojano/env/group-anagrams
...
Group Anagrams together
2025-02-06 13:03:27 +01:00
Zafir Stojanovski
40aed7fdd6
test malformed json answer
2025-02-06 10:13:02 +01:00
Zafir Stojanovski
cce890e169
use get_data_file_path to read file contents
2025-02-06 10:12:51 +01:00
Zafir Stojanovski
df8a7893dc
add source for words_alpha.txt
2025-02-06 10:12:38 +01:00
Zafir Stojanovski
7bec310591
delete words_alpha.txt
2025-02-06 10:12:25 +01:00
Andreas Köpf
14a600ee40
Merge pull request #68 from open-thought/revert-67-add-complex-number-arithmetic
...
Revert "feat: Add Complex Arithmetic Dataset and Tests"
2025-02-06 08:16:02 +01:00
Andreas Koepf
1df765e9e9
Revert "update GALLERY.md (complex_arithmetic)"
...
This reverts commit ff67fc8a51 .
2025-02-06 08:14:17 +01:00
Andreas Köpf
3f0fa88a89
Revert "feat: Add Complex Arithmetic Dataset and Tests"
2025-02-06 08:12:52 +01:00
Andreas Koepf
ff67fc8a51
update GALLERY.md (complex_arithmetic)
2025-02-06 08:00:40 +01:00
Andreas Köpf
b0922c9de3
Merge pull request #67 from idigitopia/add-complex-number-arithmetic
...
feat: Add Complex Arithmetic Dataset and Tests
2025-02-06 07:59:11 +01:00
Aayam
d49c042e64
Apply pre-commit fixes
2025-02-05 22:53:36 -08:00
Andreas Koepf
ac7180d757
move workflow permissions to job level
2025-02-06 07:26:50 +01:00
Zafir Stojanovski
dc6f6a4e7e
docs
2025-02-06 00:12:58 +01:00
Zafir Stojanovski
7f611c2e0e
group anagrams env
2025-02-06 00:11:07 +01:00
Andreas Koepf
bb618d4b87
update dataset gallery
2025-02-05 21:17:37 +01:00
Andreas Köpf
2641033c4c
Merge pull request #63 from open-thought/gsm_symbolic_tests
...
Gsm symbolic fixes
2025-02-05 21:15:35 +01:00
Andreas Koepf
1e7864eb2a
update gsmcross-check status
2025-02-05 21:14:19 +01:00
Andreas Koepf
3ca9a709e8
gsm_symbolic generator changes
2025-02-05 20:58:01 +01:00
Andreas Köpf
03c57ff569
Merge pull request #61 from open-thought/composite_dataset
...
Add composite dataset
2025-02-05 19:05:31 +01:00
Aayam
a0e291d066
feat: Add Complex Arithmetic Dataset and Tests
...
This commit introduces a new dataset for complex number arithmetic operations:
- Implements ComplexArithmeticDataset for generating complex number problems
- Supports addition, subtraction, multiplication, and division operations
Part of the algebra tasks collection in reasoning-gym.
2025-02-05 08:53:06 -08:00