joesharratt1229
|
39ee099a86
|
reimplemented prop logic
|
2025-02-20 23:59:31 +00:00 |
|
joesharratt1229
|
ed10c5f9bc
|
added testing func for prop logic
|
2025-02-20 23:59:07 +00:00 |
|
Oliver
|
f16dd9a7d4
|
Better progress tracking
|
2025-02-20 23:32:54 +00:00 |
|
Oliver
|
90547f30c7
|
Set log level
|
2025-02-20 22:36:07 +00:00 |
|
Oliver
|
e50444b924
|
Add requirements for Unsloth example
|
2025-02-20 22:29:13 +00:00 |
|
Oliver
|
29d116b034
|
Call for_inference before eval
|
2025-02-20 22:27:33 +00:00 |
|
Oliver
|
a01110aa7c
|
Add eval
|
2025-02-20 22:18:52 +00:00 |
|
Andreas Koepf
|
bedee59616
|
fix jugs unit test
|
2025-02-20 23:09:46 +01:00 |
|
Oliver
|
e26161713e
|
Answer scoring fixes to address edge cases
|
2025-02-20 22:04:01 +00:00 |
|
Andreas Koepf
|
f1daa8d1da
|
update GALLERY.my, bump version
|
2025-02-20 23:03:54 +01:00 |
|
Oliver
|
8bd18f5387
|
Add minimal unsloth GRPO example
|
2025-02-20 22:03:40 +00:00 |
|
Andreas Köpf
|
cd1a989530
|
Merge pull request #177 from zafstojano/env/rotten-oranges
feat(env): Rotten Oranges
|
2025-02-20 22:51:14 +01:00 |
|
Andreas Köpf
|
000e179781
|
Merge branch 'main' into env/rotten-oranges
|
2025-02-20 22:51:07 +01:00 |
|
Andreas Köpf
|
a8ce2747c1
|
Merge pull request #172 from open-thought/rich/jugs
Add Water Jug Puzzles
|
2025-02-20 22:48:12 +01:00 |
|
Andreas Koepf
|
c236bcf1b6
|
store possible answer in entry 'answer' field
|
2025-02-20 22:47:21 +01:00 |
|
Andreas Koepf (aider)
|
9068e1d1b1
|
docs: Add descriptive comments for num_jugs and difficulty parameters
|
2025-02-20 22:39:22 +01:00 |
|
Andreas Koepf (aider)
|
ece040a0d0
|
feat: Add type hints to generate_puzzle and min_moves_n functions
|
2025-02-20 22:38:11 +01:00 |
|
Andreas Koepf
|
d7db2ed838
|
refactor: Simplify jug puzzle dataset generation and solution verification
|
2025-02-20 22:38:10 +01:00 |
|
Zafir Stojanovski
|
8709ec1ca3
|
remove empty space
|
2025-02-20 22:35:53 +01:00 |
|
Zafir Stojanovski
|
0d65bf3668
|
rotten oranges
|
2025-02-20 22:33:39 +01:00 |
|
Andreas Köpf
|
994ffa8459
|
Merge pull request #170 from open-thought/rich/needle
Adds Needle in a Haystack problems
|
2025-02-20 22:12:47 +01:00 |
|
Oliver
|
b7ee70995e
|
Prompt tweak for code preprocessing
|
2025-02-20 20:07:32 +00:00 |
|
Oliver
|
6f9b81b879
|
Add initial CodeI/O experiment notebook
|
2025-02-20 20:03:36 +00:00 |
|
AhmedSaif2
|
73dfaa7f58
|
remove added instructions
|
2025-02-20 21:29:01 +02:00 |
|
AhmedSaif2
|
dcdef3f9ec
|
Add score answer to support comma format
|
2025-02-20 20:52:31 +02:00 |
|
Rich Jones
|
2188c53308
|
wiggle imports
|
2025-02-20 16:23:40 +01:00 |
|
Rich Jones
|
0f798457ed
|
jugs jugs jugs lint
|
2025-02-20 16:15:29 +01:00 |
|
Rich Jones
|
6f00690ae1
|
basic jugs
|
2025-02-20 15:24:46 +01:00 |
|
Rich Jones
|
39b739917e
|
move assert to config
|
2025-02-20 12:45:09 +01:00 |
|
Andreas Köpf
|
e25973b118
|
Merge pull request #158 from open-thought/rich/decimalmath
Decimal Arithmetic
|
2025-02-20 12:35:41 +01:00 |
|
Rich Jones
|
2eaffb9837
|
lint
|
2025-02-20 12:29:12 +01:00 |
|
Rich Jones
|
621c20d8d8
|
adds Needle in a Haystack problems
|
2025-02-20 12:28:30 +01:00 |
|
Rich Jones
|
0b7c221cb6
|
type hints
blind roboting
|
2025-02-20 12:10:28 +01:00 |
|
Andreas Koepf
|
147088051d
|
exclude PropositionalLogicDataset from auto-import (needs to be improved)
|
2025-02-20 12:08:48 +01:00 |
|
Andreas Koepf
|
50eab616f8
|
use correct signature for CryptarithmDataset.score_answer() method
|
2025-02-20 11:55:32 +01:00 |
|
Andreas Köpf
|
8b03bf6a92
|
Merge pull request #169 from AhmedSaif2/products
Add allow-negation in products dataset
|
2025-02-20 11:30:21 +01:00 |
|
Andreas Köpf
|
6bbe8886e5
|
Merge pull request #155 from theblackcat102/cryptarithm
Cryptarithm add score_answer function
|
2025-02-20 11:28:03 +01:00 |
|
Andreas Köpf
|
f9f3cc518b
|
Merge pull request #168 from zafstojano/fix/contributing-docs
fix(docs): CONTRIBUTING.md
|
2025-02-20 11:26:13 +01:00 |
|
Rich Jones
|
a9bbdd292a
|
rc gallery format
|
2025-02-20 11:26:05 +01:00 |
|
Rich Jones
|
445b63bbf2
|
fix weird GoL fmt
|
2025-02-20 11:09:29 +01:00 |
|
ahmedsaif
|
14b2f1f99d
|
Add allow-negation in products dataset
|
2025-02-20 12:01:09 +02:00 |
|
Rich Jones
|
da9b215278
|
add to init
|
2025-02-20 10:51:00 +01:00 |
|
Rich Jones
|
0f965c157f
|
Merge branch 'rich/decimalmath' of github.com:open-thought/reasoning-gym into rich/decimalmath
|
2025-02-20 10:46:13 +01:00 |
|
Rich Jones
|
556860d5b1
|
add random paren grouping
|
2025-02-20 10:46:01 +01:00 |
|
Rich Jones
|
edba52d2a2
|
Merge branch 'main' into rich/decimalmath
|
2025-02-20 10:07:28 +01:00 |
|
theblackcat102
|
44559aac95
|
[fix] precommit not happy
|
2025-02-20 17:00:18 +08:00 |
|
theblackcat102
|
407b212326
|
[feat] remove answer parsing since its already handled
|
2025-02-20 16:57:51 +08:00 |
|
Zafir Stojanovski
|
d557b1b4f9
|
contribution updates
|
2025-02-20 09:54:26 +01:00 |
|
Andreas Köpf
|
6cd8da9338
|
Merge pull request #166 from joesharratt1229/fix/envs
Fix/envs
|
2025-02-20 00:40:27 +01:00 |
|
Andreas Köpf
|
47edc62de8
|
Merge pull request #167 from vncntt/decimal_chain_sum
Register decimal chain sum
|
2025-02-20 00:37:56 +01:00 |
|