Commit graph

614 commits

Author SHA1 Message Date
tohskai
28fcf4d481 Refactor PolynomialMultiplicationDataset and fix issues with score_answer 2025-02-17 17:04:48 +01:00
tohskai
7bad77b426 Improve support for multivariate polynomials 2025-02-11 01:58:07 +01:00
Adefioye
767c34297f Add score_answer method to word_ladder (#93)
* Add score_answer method to word_ladder
* add unit test for WordLadderDataset::score_answer()

---------

Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-02-10 15:15:23 +01:00
Andreas Köpf
3150f9d9aa Merge pull request #97 from rishabhranawat/eval-v1
[eval-basic] initial scripts for evaluating models on reasoning gym
2025-02-10 11:59:49 +01:00
rishabhranawat
03f87dbc07 [eval-basic] remove large results files, add gitignore, only leave summary 2025-02-09 22:52:10 -08:00
rishabhranawat
2308ed99fb [eval-basic] run precommit formatting 2025-02-09 22:40:45 -08:00
rishabhranawat
94f07ed35d [eval-basic] initial scripts for evaluating models on reasoning gym 2025-02-09 22:36:27 -08:00
Andreas Koepf
c59db00196 reduce default zero probability for binary matrix 2025-02-09 20:05:56 +01:00
Andreas Köpf
0605c0cbe4 Merge pull request #91 from zafstojano/env/binary-matrix
Binary Matrix
2025-02-09 19:55:36 +01:00
Andreas Köpf
7444f774c6 Merge pull request #92 from rishabhranawat/poly-reward
Add score_answer() for PolynomialEquationsDataset
2025-02-09 19:30:24 +01:00
rishabhranawat
04b3323844 [poly-reward] run pre-commit hooks 2025-02-09 07:30:18 -08:00
Zafir Stojanovski
e5862371ed update instruction and shuffle numbers 2025-02-09 13:00:46 +01:00
Andreas Köpf
70c731e9fb Merge pull request #94 from zafstojano/fix/prime-factorization-scoring
fix(env): Prime Factorization scoring
2025-02-09 12:02:11 +01:00
Zafir Stojanovski
6cc5d0dd63 normalize answer and partial reward 2025-02-09 11:13:23 +01:00
rishabhranawat
7a6f7ea9da [poly-reward] minor updates to the docstrings 2025-02-08 21:41:18 -08:00
rishabhranawat
0f4ab53bd3 Merge branch 'main' of https://github.com/rishabhranawat/reasoning-gym into poly-reward 2025-02-08 21:37:21 -08:00
rishabhranawat
0dd4c05897 [poly-reward] add a greedy strategy scoring function for polynomial equations 2025-02-08 21:36:21 -08:00
Zafir Stojanovski
89fd56f8e9 RotateMatrix typo 2025-02-09 01:11:06 +01:00
Zafir Stojanovski
f7836e17d0 binary matrix 2025-02-09 01:10:57 +01:00
Andreas Koepf
0c7fbb5001 bump version 2025-02-09 00:39:48 +01:00
Andreas Koepf
04bffd8f59 update GALLERY.md after merging knight_swap 2025-02-09 00:35:56 +01:00
Andreas Köpf
17ea7dd975 Merge pull request #89 from JeanKaddour/feat-swap-knights-puzzles
Feat swap knights puzzles
2025-02-09 00:33:48 +01:00
Andreas Köpf
f5a6dabb8b Merge pull request #90 from open-thought/arc_agi_1_dataset
ARC-AGI-1 dataset with augmentations
2025-02-09 00:19:20 +01:00
Andreas Koepf (aider)
ec8036c099 feat: Add configurable rotation and mirror augmentation variants 2025-02-09 00:16:41 +01:00
Andreas Koepf
b73040b066 refactor: Reorganize ArcAgiConfig class attributes for better readability 2025-02-09 00:12:59 +01:00
Andreas Koepf
e56316ebb2 formatting 2025-02-09 00:04:42 +01:00
Andreas Koepf (aider)
8d8d85e6b2 fix: Add missing Callable import to arc_agi.py 2025-02-08 23:59:53 +01:00
Andreas Koepf (aider)
cdb9d8d8f8 feat: Add configurable augmentations to ArcAgiDataset with consistent application 2025-02-08 23:59:45 +01:00
Andreas Koepf
1795cd815c add rotate, mirror & color-mapping augmentation functions 2025-02-08 23:51:38 +01:00
Andreas Koepf (aider)
f72bd8d6a5 test: Add comprehensive unit tests for ArcAgiDataset 2025-02-08 23:20:45 +01:00
Andreas Koepf
4e49806d22 add ArcAgiDataset class, fix score_entry() metadata params 2025-02-08 23:18:18 +01:00
Andreas Koepf
2ad0965fdc move arc_1d into from cognition into arc folder 2025-02-08 19:37:26 +01:00
Andreas Koepf
15fecda148 clarify number_filtering task 2025-02-08 19:32:45 +01:00
Andreas Koepf
aaa89b0b3f update gallery spiral_matrix 2025-02-08 19:15:26 +01:00
Andreas Köpf
8512a024fd Merge pull request #85 from zafstojano/env/spiral-matrix
Spiral Matrix
2025-02-08 19:14:02 +01:00
Andreas Koepf
8160665990 remove unnecessary newline from arc prompt 2025-02-08 19:12:41 +01:00
Andreas Koepf
052c983cd5 re-arc cleanup 2025-02-08 19:07:28 +01:00
Zafir Stojanovski
df896b0f6c signle digit numbers, better explanation, max_cols == max_rows == max_n 2025-02-08 18:53:25 +01:00
Zafir Stojanovski
6815a892e2 Merge branch 'main' of https://github.com/open-thought/reasoning-gym into env/spiral-matrix 2025-02-08 18:52:45 +01:00
Andreas Köpf
9fe245200c Merge pull request #88 from joesharratt1229/feat/re-arc
Feat/re arc
2025-02-08 18:20:17 +01:00
Andreas Köpf
401348ea8c Merge pull request #87 from zafstojano/env/rotate-matrix
Rotate Matrix k times
2025-02-08 17:46:58 +01:00
Andreas Köpf
b0a133904f Merge branch 'main' into env/rotate-matrix 2025-02-08 17:42:04 +01:00
Andreas Köpf
7062158f4f Merge pull request #86 from zafstojano/env/ransom-note
Ransom Note
2025-02-08 17:39:57 +01:00
Andreas Köpf
e69c3f5e0d Merge branch 'main' into env/ransom-note 2025-02-08 17:34:31 +01:00
Andreas Koepf
7bc0d00aa9 lint 2025-02-08 17:22:55 +01:00
Andreas Koepf (aider)
38d5caa928 feat: Add inversion probability and logical equivalence to syllogisms 2025-02-08 17:14:35 +01:00
Jean Kaddour
3fec09c9ca chore: run isort 2025-02-08 15:53:29 +00:00
Jean Kaddour
5be7f51cd6 Update GALLERY.md to include Knight Swap 2025-02-08 15:49:10 +00:00
Jean Kaddour
ff6d8b08cf feat: add knight_swap 2025-02-08 15:38:45 +00:00
Andreas Koepf
1cf8981299 add CONTRIBUTING.md, simplify README.md 2025-02-08 15:59:44 +01:00