Commit graph

110 commits

Author SHA1 Message Date
Ritvik Rastogi
49b07130b3
feat: add scoring cascade for reducing false negatives (#526)
* feat: add scoring cascade for reducing false negatives in answer verification

* style: fix black and isort formatting

Run black and isort to satisfy pre-commit checks.

Made-with: Cursor

* docs: add scoring cascade example to Quickstart section

Mention the experimental scoring cascade feature at the end of the
Quickstart section with a disclaimer and complete usage examples
showing both the dataset method and standalone function.

Made-with: Cursor

* docs: shorten scoring cascade section in README

Trim to a concise standalone example per review feedback.

Made-with: Cursor

* docs: simplify scoring cascade description in README

Made-with: Cursor

* update readme

---------

Co-authored-by: Zafir Stojanovski <zaf.stojano@gmail.com>
2026-04-17 21:39:15 +02:00
Zafir Stojanovski
9a91d92ca6
Update README.md with new project (#519)
Add Apple's Multilingual Reasoning Gym paper to the projects list.

Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-15 17:13:56 +01:00
Zafir Stojanovski
51bbe8c62b
Update README.md with new project (#518)
Add NVIDIA Nemotron 3 Super to the list of projects using Reasoning Gym.

Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-15 13:53:52 +01:00
Zafir Stojanovski
bcc68c5fee
Update README.md with new project (#510) 2025-10-02 17:43:32 +02:00
Zafir Stojanovski
15d7f027e4
add mila projects (#508) 2025-09-29 14:37:13 +01:00
Zafir Stojanovski
3fcb8642c6
(README): add gensyn paper (#506)
* gensyn paper
2025-09-11 17:11:04 +02:00
Rich Jones
b399c658ca
Add OptimalThinkingBench to Projects Using RG (#503)
* Add OptimalThinkingBench to Projects Using RG

* Update README.md
2025-08-24 21:36:11 +02:00
Zafir Stojanovski
b8aa55704b
add discord link (#500) 2025-08-05 09:57:46 +01:00
Zafir Stojanovski
86c4f8552f
add GEM to projects using RG (#498) 2025-08-02 09:09:53 +01:00
Zafir Stojanovski
bf451d5197
Update README.md (#483) 2025-07-05 01:57:21 +02:00
Rich Jones
d9cd20c174
Update README.md (RLSwarm GenRL) (#480) 2025-06-26 10:20:45 +01:00
Oliver Stanley
1c9ed2e0eb
better usage demo in readme (#477)
* better usage demo in readme

* example of non-default configs
2025-06-25 13:38:25 -07:00
Zafir Stojanovski
c2ac6fae32
Update README.md (#475) 2025-06-24 14:19:11 +01:00
Zafir Stojanovski
0159b1b571
Update README.md - Star History (#463) 2025-06-08 11:51:43 +01:00
Andreas Köpf
be2babea9c
Use raw URLs for images in README.md (#459)
On pypi images were not correctly rendered because the old img src urls in README.md
pointed to files on github with UI.
2025-06-06 21:23:59 +01:00
Oliver Stanley
1232a7d1e5
simplify training setup instructions (#454)
* simplify training setup instructions

* tweaks

* update cfgs

* readme update

* readme update
2025-06-06 09:51:29 +01:00
Zafir Stojanovski
0ebabf709b
Update README.md with Atropos (#458) 2025-06-06 09:24:25 +01:00
Zafir Stojanovski
0699e2f507
Update README.md (#451) 2025-06-04 12:45:23 +02:00
Zafir Stojanovski
84958baa69
abs path for images (#449)
* abs path for images

* width and height outside of style
2025-06-04 10:33:13 +02:00
Oliver Stanley
2a57a95ca2
add minimal example for building training datasets (#448) 2025-06-03 19:28:45 +01:00
Zafir Stojanovski
b3f81a6609
fix(README): Arxiv link (#447) 2025-06-02 12:20:38 +02:00
Zafir Stojanovski
af2548f8f2
Add README assets (#446)
* add assets

* pre-commit

* remove bg
2025-06-02 10:33:54 +02:00
Zafir Stojanovski
ec485cce24
Add banner in README.md (#399) 2025-04-01 21:29:41 +02:00
Oliver Stanley
ea10a0f932
update task count in readme (#400)
* update task count in readme

* fix link
2025-04-01 10:51:36 +02:00
Rich Jones
0d84a8ecf5 fix mkd 2025-02-21 15:14:36 +01:00
Rich Jones
0730664eb6 link to eval readme 2025-02-21 15:13:29 +01:00
Rich Jones
84583b450a edits 2025-02-21 15:09:11 +01:00
Rich Jones
2cb395ba30 polish readme a bit 2025-02-21 15:04:04 +01:00
Andreas Koepf
f2e02d6d08 add CONTRIBUTING.md, simplify README.md 2025-02-08 15:59:44 +01:00
Andreas Koepf
f522cbb349 use full link to gallery for PyPI 2025-02-07 18:29:45 +01:00
Andreas Köpf
eb8b7afea4
Merge pull request #74 from zafstojano/env/isomorphic-strings
Isomorphic Strings
2025-02-07 18:25:09 +01:00
Zafir Stojanovski
d78ce0a9f7 isomorphic strings 2025-02-07 18:23:34 +01:00
Rich Jones
bd8fc9beeb add self-reference puzzles 2025-02-07 15:09:42 +01:00
Andreas Köpf
2458d3a646
Merge pull request #78 from JeanKaddour/main
Feat: Add Tsumego
2025-02-07 14:10:29 +01:00
tohskai
847442ef0a
Add PolynomialMultiplicationDataset (#64)
* Add PolynomialMultiplicationDataset
2025-02-07 14:06:41 +01:00
Jean Kaddour
f625b9a68f feat: add tsumego 2025-02-07 11:22:33 +00:00
Andreas Köpf
426fa22fcc
Sokoban without pygame (#77)
* add minified version of https://github.com/xbandrade/sokoban-solver-generator

---------

Co-authored-by: Rich Jones <miserlou@gmail.com>
2025-02-07 11:57:53 +01:00
Andreas Köpf
d2bef8d30f
Merge pull request #65 from zafstojano/env/group-anagrams
Group Anagrams together
2025-02-06 13:03:27 +01:00
Andreas Köpf
334f43df53
Revert "feat: Add Complex Arithmetic Dataset and Tests" 2025-02-06 08:12:52 +01:00
Zafir Stojanovski
b4f412956e group anagrams env 2025-02-06 00:11:07 +01:00
Aayam
5be79bcb1b feat: Add Complex Arithmetic Dataset and Tests
This commit introduces a new dataset for complex number arithmetic operations:

- Implements ComplexArithmeticDataset for generating complex number problems
- Supports addition, subtraction, multiplication, and division operations

Part of the algebra tasks collection in reasoning-gym.
2025-02-05 08:53:06 -08:00
Zafir Stojanovski
eee0b36983 course schedule 2025-02-04 23:50:24 +01:00
Andreas Köpf
0cbd376dc1
Merge pull request #57 from zafstojano/env/largest-island
Find Largest Island (BFS)
2025-02-04 00:20:06 +01:00
Zafir Stojanovski
f7ebdd0622 update readmes 2025-02-03 22:51:35 +01:00
Rich Jones
4a19aa8f14 readme 2025-02-03 16:49:18 +01:00
Andreas Köpf
3dd5a4df2e
Merge pull request #47 from zafstojano/feat/n-queens
feat(env): N Queens
2025-02-02 21:54:02 +01:00
rishabhranawat
b69cb27f75 Merge branch 'aiw' of https://github.com/rishabhranawat/reasoning-gym into aiw 2025-02-02 08:26:30 -08:00
rishabhranawat
519999ff89 Update dataset list w/ some missing logic datasets 2025-02-02 08:26:05 -08:00
Zafir Stojanovski
c74b600085 n queens 2025-02-02 16:47:21 +01:00
benjamrio
943651c15b added calendar-arithmetic tasks 2025-02-02 14:54:32 +01:00