Commit graph

177 commits

Author SHA1 Message Date
Rich Jones
edba52d2a2
Merge branch 'main' into rich/decimalmath 2025-02-20 10:07:28 +01:00
vncntt
992896af29
Merge branch 'open-thought:main' into decimal_chain_sum 2025-02-19 15:31:20 -08:00
vncntt
cdcd79ae2b register decimal_chain_sum 2025-02-19 15:30:30 -08:00
Andreas Köpf
1cf44267f3
Merge pull request #164 from vncntt/decimal_chain_sum
decimal_chain_sum
2025-02-20 00:14:23 +01:00
vncntt
83e5e92126 decimal refactor 2025-02-19 14:46:27 -08:00
Rich Jones
17ba950c1a implement decimal precision 2025-02-19 12:30:07 +01:00
Zafir Stojanovski
f9a3fd4ddf number format 2025-02-19 11:41:06 +01:00
vncntt
60cedc62dd decimal_chain_sum 2025-02-19 01:51:10 -08:00
Andreas Koepf
667d088f55 remove redundant assert in ChainSumConfig.validate() 2025-02-19 09:42:32 +01:00
Rich Jones
59229bd2d2 Merge branch 'rich/decimalmath' of github.com:open-thought/reasoning-gym into rich/decimalmath 2025-02-19 03:34:57 +01:00
Rich Jones
19b697c89a [wip] more flexible api 2025-02-19 03:29:40 +01:00
Rich Jones
2532cdce8e
Merge branch 'main' into rich/decimalmath 2025-02-18 20:53:36 +01:00
Rich Jones
46cdfc71cf lint 2025-02-18 19:22:03 +01:00
Rich Jones
80d8dbcc36 decimal math 2025-02-18 19:20:33 +01:00
Andreas Koepf
203394314e minor formatting changes 2025-02-17 18:20:18 +01:00
abdulhakeem
15140d06e7 Tweaked some question templates 2025-02-17 02:58:42 -06:00
Andreas Koepf
2cbaab2918 fix question templates 2025-02-16 23:04:24 +01:00
joesharratt1229
95f179f34e
Merge pull request #144 from joesharratt1229/fix/arithmetic
Added fixes for arithmetic environments
2025-02-16 16:34:09 +00:00
joesharratt1229
6b82654074 adjusted gsm symbolic question template 2025-02-16 15:28:44 +00:00
joesharratt1229
44bc04c00b changed products question template 2025-02-16 15:18:41 +00:00
Andreas Koepf
44f8d7b6af formatting 2025-02-16 16:18:39 +01:00
joesharratt1229
3ffab158c7 added reasoning gtm gcd 2025-02-16 15:10:29 +00:00
joesharratt1229
86814d260c added fraction simplifications score answer impl 2025-02-16 15:08:24 +00:00
joesharratt1229
2071ad42c2 fixed chain sum 2025-02-16 14:09:16 +00:00
joesharratt1229
73910f0f61 merged remoted
Merge remote-tracking branch 'origin/main' into feat/score-answer-impl
2025-02-16 12:03:25 +00:00
joesharratt1229
ba58ecf8ea corrected failing airthmetic test 2025-02-16 12:01:54 +00:00
joesharratt1229
52869e3f89 reformatted prompt 2025-02-16 09:27:21 +00:00
joesharratt1229
68be1f6177 reformatted basic airth question template 2025-02-16 09:07:56 +00:00
Zafir Stojanovski
881c48b145 fix leg counting prompt template 2025-02-16 09:58:36 +01:00
Zafir Stojanovski
9ca18f07e0 fix prompts 2025-02-15 20:59:07 +01:00
Andreas Koepf
5410bb78a0 add ProductsDataset (multiplication tasks) 2025-02-13 17:59:02 +01:00
Andreas Koepf (aider)
bdcaeff42a feat: Add ProductsDataset with configurable terms and digits 2025-02-13 17:50:19 +01:00
Andreas Koepf
3ead141db5 feat: Add PowerFunctionConfig and PowerFunctionDataset to arithmetic module exports 2025-02-13 17:50:17 +01:00
Andreas Koepf
ab9f781d97 use *args param for _define_attributes() 2025-02-12 16:59:09 +01:00
Andreas Köpf
411a1789b1
Merge pull request #110 from open-thought/rich/dice
Adds Dice Probability Dataset
2025-02-11 22:54:02 +01:00
Andreas Köpf
3f8380d93a
Merge pull request #99 from open-thought/curriculum_basics
Add foundation for auto-curriculum
2025-02-11 22:52:14 +01:00
Rich Jones
0a4799d99a clarity 2025-02-11 16:22:53 +01:00
Rich Jones
945207da43 fmt 2025-02-11 12:54:23 +01:00
Rich Jones
852ddfcea3 add dice dataset 2025-02-11 12:53:13 +01:00
Andreas Koepf
4abcd1f1df update gallery, lower default config values for PowerFunctionDataset 2025-02-10 22:42:04 +01:00
Zafir Stojanovski
ed10111834
count bits (#101) 2025-02-10 22:12:50 +01:00
Zafir Stojanovski
a8c39ddcfb
Power Function (#102)
* power function dataset + tests
2025-02-10 22:04:58 +01:00
Andreas Koepf
8772041afb Add attributes for curriculum
Co-authored-by: EduardDurech <39579228+EduardDurech@users.noreply.github.com>
2025-02-10 18:58:07 +01:00
Zafir Stojanovski
ef2a412c8b normalize answer and partial reward 2025-02-09 11:13:23 +01:00
Andreas Köpf
3f6b2fc807
Add Coaching & ScoreBoard class (result tracking) (#72)
* feat: Add Coach and ScoreBoard classes for performance tracking and difficulty adjustment
* feat: Add GroupedScores class to wrap aggregated scores
* refactor: Create ScoreStats class with tuple-based score statistics
* feat: Add unit test for Coach with CompositeDataset and multiple datasets
* fix: Add difficulty metadata to leg counting dataset
* feat: Add clear() method to ScoreBoard to reset all stored data
* feat: Add __len__ method to ScoreBoard to return number of scores
* feat: Add update_dataset_config method to CompositeDataset
* cleanup __init__ & imports
2025-02-06 23:15:28 +01:00
Andreas Koepf
afb95508ef gsm_symbolic generator changes 2025-02-05 20:58:01 +01:00
Andreas Koepf
04cd81dd76 use explicit rng for zebra generation (not yet fully deterministic) 2025-02-04 00:00:54 +01:00
Andreas Koepf
c8fcb6ca02 black formatting 2025-02-03 22:57:24 +01:00
Adefioye
40b5c5583c
Merge branch 'main' into koko/gsm-symbolic-task-1 2025-02-03 01:23:26 -06:00
abdulhakeem
f2b5e88390 Refactor gsm symbolic dataset 2025-02-03 01:21:21 -06:00