Aayam
2170ff1c23
pre commit run changes
2025-02-07 07:42:03 -08:00
Aayam
7ddc6390e6
All number are integers now.
2025-02-07 07:34:17 -08:00
Aayam
e2ce20bcbb
Apply pre-commit fixes
2025-02-07 07:01:20 -08:00
Aayam
f93c00a16b
added explicit check for answer to metadata result match
2025-02-07 07:01:20 -08:00
Andreas Köpf
2458d3a646
Merge pull request #78 from JeanKaddour/main
...
Feat: Add Tsumego
2025-02-07 14:10:29 +01:00
tohskai
847442ef0a
Add PolynomialMultiplicationDataset ( #64 )
...
* Add PolynomialMultiplicationDataset
2025-02-07 14:06:41 +01:00
Jean Kaddour
f625b9a68f
feat: add tsumego
2025-02-07 11:22:33 +00:00
Andreas Köpf
426fa22fcc
Sokoban without pygame ( #77 )
...
* add minified version of https://github.com/xbandrade/sokoban-solver-generator
---------
Co-authored-by: Rich Jones <miserlou@gmail.com>
2025-02-07 11:57:53 +01:00
Andreas Köpf
7b72c3470b
docs: Update TRL README with GRPO example details and usage instructions ( #76 )
2025-02-07 07:56:22 +01:00
joesharratt1229
a8e11e71be
Test training with trl ( #70 )
...
* first trl grpo implementation
* added config yaml file
* added read me and dependencies
* updated reward format func
2025-02-07 07:42:32 +01:00
Andreas Köpf
3f6b2fc807
Add Coaching & ScoreBoard class (result tracking) ( #72 )
...
* feat: Add Coach and ScoreBoard classes for performance tracking and difficulty adjustment
* feat: Add GroupedScores class to wrap aggregated scores
* refactor: Create ScoreStats class with tuple-based score statistics
* feat: Add unit test for Coach with CompositeDataset and multiple datasets
* fix: Add difficulty metadata to leg counting dataset
* feat: Add clear() method to ScoreBoard to reset all stored data
* feat: Add __len__ method to ScoreBoard to return number of scores
* feat: Add update_dataset_config method to CompositeDataset
* cleanup __init__ & imports
2025-02-06 23:15:28 +01:00
Andreas Köpf
7c08c05b1e
Merge pull request #75 from zafstojano/chore/remove-unused-methods
...
chore(envs): Remove unmodified dunder methods
2025-02-06 23:12:43 +01:00
Zafir Stojanovski
b5a820a8d9
remove unmodified dunder methods
2025-02-06 22:56:11 +01:00
Andreas Koepf
071c22a809
remove redundant methods from GroupAnagramsDataset
2025-02-06 14:21:03 +01:00
Andreas Köpf
d2bef8d30f
Merge pull request #65 from zafstojano/env/group-anagrams
...
Group Anagrams together
2025-02-06 13:03:27 +01:00
Zafir Stojanovski
8244657ab9
test malformed json answer
2025-02-06 10:13:02 +01:00
Zafir Stojanovski
256eb71555
use get_data_file_path to read file contents
2025-02-06 10:12:51 +01:00
Zafir Stojanovski
6ec6876221
add source for words_alpha.txt
2025-02-06 10:12:38 +01:00
Zafir Stojanovski
5cc45d6939
delete words_alpha.txt
2025-02-06 10:12:25 +01:00
Andreas Köpf
34b13e4f74
Merge pull request #68 from open-thought/revert-67-add-complex-number-arithmetic
...
Revert "feat: Add Complex Arithmetic Dataset and Tests"
2025-02-06 08:16:02 +01:00
Andreas Koepf
088e6273fd
Revert "update GALLERY.md (complex_arithmetic)"
...
This reverts commit f512b391ad .
2025-02-06 08:14:17 +01:00
Andreas Köpf
334f43df53
Revert "feat: Add Complex Arithmetic Dataset and Tests"
2025-02-06 08:12:52 +01:00
Andreas Koepf
f512b391ad
update GALLERY.md (complex_arithmetic)
2025-02-06 08:00:40 +01:00
Andreas Köpf
529f83f522
Merge pull request #67 from idigitopia/add-complex-number-arithmetic
...
feat: Add Complex Arithmetic Dataset and Tests
2025-02-06 07:59:11 +01:00
Aayam
9280d22b83
Apply pre-commit fixes
2025-02-05 22:53:36 -08:00
Andreas Koepf
5da9d9a4c8
move workflow permissions to job level
2025-02-06 07:26:50 +01:00
Zafir Stojanovski
7827025f7b
docs
2025-02-06 00:12:58 +01:00
Zafir Stojanovski
b4f412956e
group anagrams env
2025-02-06 00:11:07 +01:00
Andreas Koepf
9ed1fd18d2
update dataset gallery
2025-02-05 21:17:37 +01:00
Andreas Köpf
aa024ce5b9
Merge pull request #63 from open-thought/gsm_symbolic_tests
...
Gsm symbolic fixes
2025-02-05 21:15:35 +01:00
Andreas Koepf
40420a35b9
update gsmcross-check status
2025-02-05 21:14:19 +01:00
Andreas Koepf
afb95508ef
gsm_symbolic generator changes
2025-02-05 20:58:01 +01:00
Andreas Köpf
c4c0b3b2d8
Merge pull request #61 from open-thought/composite_dataset
...
Add composite dataset
2025-02-05 19:05:31 +01:00
Aayam
5be79bcb1b
feat: Add Complex Arithmetic Dataset and Tests
...
This commit introduces a new dataset for complex number arithmetic operations:
- Implements ComplexArithmeticDataset for generating complex number problems
- Supports addition, subtraction, multiplication, and division operations
Part of the algebra tasks collection in reasoning-gym.
2025-02-05 08:53:06 -08:00
Zafir Stojanovski
76a3d4761c
generate all english anagrams
2025-02-05 16:25:23 +01:00
Andreas Köpf
b84e29a8b6
Merge pull request #62 from zafstojano/env/course-schedule
...
Course Schedule (Topological Sort)
2025-02-05 14:18:37 +01:00
Zafir Stojanovski
3121d3768a
typo
2025-02-05 12:08:18 +01:00
Zafir Stojanovski
eee0b36983
course schedule
2025-02-04 23:50:24 +01:00
Andreas Koepf
48999261dd
register composite dataset
2025-02-04 19:17:34 +01:00
Andreas Koepf (aider)
0c08ee5daa
feat: Add pyyaml dependency to project configuration
2025-02-04 19:07:52 +01:00
Andreas Koepf (aider)
2c86860b57
fix: Correct indentation and implementation of create_dataset function
2025-02-04 19:06:41 +01:00
Andreas Koepf (aider)
0387127b30
fix: Move dataset registration after function definition to resolve undefined name error
2025-02-04 19:06:24 +01:00
Andreas Koepf (aider)
c2e77f92aa
Based on the implementation and requirements, here's a concise commit message:
...
feat: Add CompositeDataset for weighted multi-dataset sampling
2025-02-04 19:06:13 +01:00
Andreas Koepf
cd3c95baf0
update notice of 3rd party code import
2025-02-04 13:47:57 +01:00
Andreas Koepf
f5e87856bf
use PYTHONHASHSEED=1 for generate_gallery.py
2025-02-04 12:03:45 +01:00
Andreas Köpf
7506592775
Merge pull request #59 from open-thought/fix_zebra_order
...
Make zebra puzzle clue order deterministic
2025-02-04 11:48:52 +01:00
Andreas Koepf
9bc92952f8
minimize changes
2025-02-04 11:46:19 +01:00
Andreas Koepf
f2b4c3d078
use sorted() and OrderedDict to make zebra puzzle clue order deterministic
2025-02-04 11:24:04 +01:00
Andreas Köpf
0cbd376dc1
Merge pull request #57 from zafstojano/env/largest-island
...
Find Largest Island (BFS)
2025-02-04 00:20:06 +01:00
Andreas Koepf
f5128207a6
minor logic puzzle changes
2025-02-04 00:18:21 +01:00