Rich Jones
eb64e3a2b8
add random paren grouping
2025-02-20 10:46:01 +01:00
Rich Jones
a76e56fccc
implement decimal precision
2025-02-19 12:30:07 +01:00
Rich Jones
0cd2eb50d7
Merge branch 'rich/decimalmath' of github.com:open-thought/reasoning-gym into rich/decimalmath
2025-02-19 03:34:57 +01:00
Rich Jones
c3bc7937dc
[wip] more flexible api
2025-02-19 03:29:40 +01:00
Rich Jones
36b2bceda3
Merge branch 'main' into rich/decimalmath
2025-02-18 20:53:36 +01:00
Rich Jones
bd56a7b1fe
lint
2025-02-18 19:22:03 +01:00
Rich Jones
e2bdfb4c1c
decimal math
2025-02-18 19:20:33 +01:00
Andreas Koepf
aa794253fe
minor formatting changes
2025-02-17 18:20:18 +01:00
abdulhakeem
5b1e42f878
Tweaked some question templates
2025-02-17 02:58:42 -06:00
Andreas Koepf
99b49f868f
fix question templates
2025-02-16 23:04:24 +01:00
joesharratt1229
cf6c15d0ee
Merge pull request #144 from joesharratt1229/fix/arithmetic
...
Added fixes for arithmetic environments
2025-02-16 16:34:09 +00:00
joesharratt1229
fff40f4f36
adjusted gsm symbolic question template
2025-02-16 15:28:44 +00:00
joesharratt1229
72886c0668
changed products question template
2025-02-16 15:18:41 +00:00
Andreas Koepf
6bf2dfa36c
formatting
2025-02-16 16:18:39 +01:00
joesharratt1229
e00fc81ba8
added reasoning gtm gcd
2025-02-16 15:10:29 +00:00
joesharratt1229
3f731029dd
added fraction simplifications score answer impl
2025-02-16 15:08:24 +00:00
joesharratt1229
1a33fba608
fixed chain sum
2025-02-16 14:09:16 +00:00
joesharratt1229
cc7d422d78
merged remoted
...
Merge remote-tracking branch 'origin/main' into feat/score-answer-impl
2025-02-16 12:03:25 +00:00
joesharratt1229
569517664f
corrected failing airthmetic test
2025-02-16 12:01:54 +00:00
joesharratt1229
a59e4cc918
reformatted prompt
2025-02-16 09:27:21 +00:00
joesharratt1229
c28688cb96
reformatted basic airth question template
2025-02-16 09:07:56 +00:00
Zafir Stojanovski
b9bd7a1162
fix leg counting prompt template
2025-02-16 09:58:36 +01:00
Zafir Stojanovski
662c4e16c2
fix prompts
2025-02-15 20:59:07 +01:00
Andreas Koepf
1996ffa6d8
add ProductsDataset (multiplication tasks)
2025-02-13 17:59:02 +01:00
Andreas Koepf (aider)
583ebfd5de
feat: Add ProductsDataset with configurable terms and digits
2025-02-13 17:50:19 +01:00
Andreas Koepf
321b2a88e0
feat: Add PowerFunctionConfig and PowerFunctionDataset to arithmetic module exports
2025-02-13 17:50:17 +01:00
Andreas Koepf
c6a9dae0c6
use *args param for _define_attributes()
2025-02-12 16:59:09 +01:00
Andreas Köpf
8d917d133d
Merge pull request #110 from open-thought/rich/dice
...
Adds Dice Probability Dataset
2025-02-11 22:54:02 +01:00
Andreas Köpf
fa2a11ae56
Merge pull request #99 from open-thought/curriculum_basics
...
Add foundation for auto-curriculum
2025-02-11 22:52:14 +01:00
Rich Jones
16bf151786
clarity
2025-02-11 16:22:53 +01:00
Rich Jones
9cd4e825d4
fmt
2025-02-11 12:54:23 +01:00
Rich Jones
93a7a58023
add dice dataset
2025-02-11 12:53:13 +01:00
Andreas Koepf
eb25ab9656
update gallery, lower default config values for PowerFunctionDataset
2025-02-10 22:42:04 +01:00
Zafir Stojanovski
3e42d9588e
count bits ( #101 )
2025-02-10 22:12:50 +01:00
Zafir Stojanovski
178895ab1b
Power Function ( #102 )
...
* power function dataset + tests
2025-02-10 22:04:58 +01:00
Andreas Koepf
357a89fe8c
Add attributes for curriculum
...
Co-authored-by: EduardDurech <39579228+EduardDurech@users.noreply.github.com>
2025-02-10 18:58:07 +01:00
Zafir Stojanovski
6cc5d0dd63
normalize answer and partial reward
2025-02-09 11:13:23 +01:00
Andreas Köpf
a607db79f7
Add Coaching & ScoreBoard class (result tracking) ( #72 )
...
* feat: Add Coach and ScoreBoard classes for performance tracking and difficulty adjustment
* feat: Add GroupedScores class to wrap aggregated scores
* refactor: Create ScoreStats class with tuple-based score statistics
* feat: Add unit test for Coach with CompositeDataset and multiple datasets
* fix: Add difficulty metadata to leg counting dataset
* feat: Add clear() method to ScoreBoard to reset all stored data
* feat: Add __len__ method to ScoreBoard to return number of scores
* feat: Add update_dataset_config method to CompositeDataset
* cleanup __init__ & imports
2025-02-06 23:15:28 +01:00
Andreas Koepf
3ca9a709e8
gsm_symbolic generator changes
2025-02-05 20:58:01 +01:00
Andreas Koepf
94f877d17a
use explicit rng for zebra generation (not yet fully deterministic)
2025-02-04 00:00:54 +01:00
Andreas Koepf
156b09951e
black formatting
2025-02-03 22:57:24 +01:00
Adefioye
a80339a0e6
Merge branch 'main' into koko/gsm-symbolic-task-1
2025-02-03 01:23:26 -06:00
abdulhakeem
5e71fb2694
Refactor gsm symbolic dataset
2025-02-03 01:21:21 -06:00
Andreas Koepf
94eeff3255
post merge formatting
2025-02-02 15:24:39 +01:00
benjamrio
7acd4cb1e5
added calendar-arithmetic tasks
2025-02-02 14:54:32 +01:00
Andreas Koepf (aider)
2c979c3913
refactor: Use field default_factory TimeIntervalsConfig, AdvancedGeometryConfig
2025-02-02 09:55:51 +01:00
abdulhakeem
548b8fab15
Remove data
2025-02-01 20:31:44 -06:00
abdulhakeem
1e0dbc9875
Create python generator files for gsm symbolic templates
2025-02-01 20:23:30 -06:00
Andreas Koepf
90e448342f
add pull-request: write permission for pre-commit checks
2025-02-01 02:36:20 +01:00
Andreas Koepf
44f32e3862
Add time interval dataset class
2025-02-01 02:10:48 +01:00