Commit graph

104 commits

Author SHA1 Message Date
Andreas Koepf
6bf2dfa36c formatting 2025-02-16 16:18:39 +01:00
joesharratt1229
cc7d422d78 merged remoted
Merge remote-tracking branch 'origin/main' into feat/score-answer-impl
2025-02-16 12:03:25 +00:00
joesharratt1229
569517664f corrected failing airthmetic test 2025-02-16 12:01:54 +00:00
joesharratt1229
a59e4cc918 reformatted prompt 2025-02-16 09:27:21 +00:00
joesharratt1229
c28688cb96 reformatted basic airth question template 2025-02-16 09:07:56 +00:00
Zafir Stojanovski
b9bd7a1162 fix leg counting prompt template 2025-02-16 09:58:36 +01:00
Zafir Stojanovski
662c4e16c2 fix prompts 2025-02-15 20:59:07 +01:00
Andreas Koepf
1996ffa6d8 add ProductsDataset (multiplication tasks) 2025-02-13 17:59:02 +01:00
Andreas Koepf (aider)
583ebfd5de feat: Add ProductsDataset with configurable terms and digits 2025-02-13 17:50:19 +01:00
Andreas Koepf
321b2a88e0 feat: Add PowerFunctionConfig and PowerFunctionDataset to arithmetic module exports 2025-02-13 17:50:17 +01:00
Andreas Koepf
c6a9dae0c6 use *args param for _define_attributes() 2025-02-12 16:59:09 +01:00
Andreas Köpf
8d917d133d Merge pull request #110 from open-thought/rich/dice
Adds Dice Probability Dataset
2025-02-11 22:54:02 +01:00
Andreas Köpf
fa2a11ae56 Merge pull request #99 from open-thought/curriculum_basics
Add foundation for auto-curriculum
2025-02-11 22:52:14 +01:00
Rich Jones
16bf151786 clarity 2025-02-11 16:22:53 +01:00
Rich Jones
9cd4e825d4 fmt 2025-02-11 12:54:23 +01:00
Rich Jones
93a7a58023 add dice dataset 2025-02-11 12:53:13 +01:00
Andreas Koepf
eb25ab9656 update gallery, lower default config values for PowerFunctionDataset 2025-02-10 22:42:04 +01:00
Zafir Stojanovski
3e42d9588e count bits (#101) 2025-02-10 22:12:50 +01:00
Zafir Stojanovski
178895ab1b Power Function (#102)
* power function dataset + tests
2025-02-10 22:04:58 +01:00
Andreas Koepf
357a89fe8c Add attributes for curriculum
Co-authored-by: EduardDurech <39579228+EduardDurech@users.noreply.github.com>
2025-02-10 18:58:07 +01:00
Zafir Stojanovski
6cc5d0dd63 normalize answer and partial reward 2025-02-09 11:13:23 +01:00
Andreas Köpf
a607db79f7 Add Coaching & ScoreBoard class (result tracking) (#72)
* feat: Add Coach and ScoreBoard classes for performance tracking and difficulty adjustment
* feat: Add GroupedScores class to wrap aggregated scores
* refactor: Create ScoreStats class with tuple-based score statistics
* feat: Add unit test for Coach with CompositeDataset and multiple datasets
* fix: Add difficulty metadata to leg counting dataset
* feat: Add clear() method to ScoreBoard to reset all stored data
* feat: Add __len__ method to ScoreBoard to return number of scores
* feat: Add update_dataset_config method to CompositeDataset
* cleanup __init__ & imports
2025-02-06 23:15:28 +01:00
Andreas Koepf
3ca9a709e8 gsm_symbolic generator changes 2025-02-05 20:58:01 +01:00
Andreas Koepf
94f877d17a use explicit rng for zebra generation (not yet fully deterministic) 2025-02-04 00:00:54 +01:00
Andreas Koepf
156b09951e black formatting 2025-02-03 22:57:24 +01:00
Adefioye
a80339a0e6 Merge branch 'main' into koko/gsm-symbolic-task-1 2025-02-03 01:23:26 -06:00
abdulhakeem
5e71fb2694 Refactor gsm symbolic dataset 2025-02-03 01:21:21 -06:00
Andreas Koepf
94eeff3255 post merge formatting 2025-02-02 15:24:39 +01:00
benjamrio
7acd4cb1e5 added calendar-arithmetic tasks 2025-02-02 14:54:32 +01:00
Andreas Koepf (aider)
2c979c3913 refactor: Use field default_factory TimeIntervalsConfig, AdvancedGeometryConfig 2025-02-02 09:55:51 +01:00
abdulhakeem
548b8fab15 Remove data 2025-02-01 20:31:44 -06:00
abdulhakeem
1e0dbc9875 Create python generator files for gsm symbolic templates 2025-02-01 20:23:30 -06:00
Andreas Koepf
90e448342f add pull-request: write permission for pre-commit checks 2025-02-01 02:36:20 +01:00
Andreas Koepf
44f32e3862 Add time interval dataset class 2025-02-01 02:10:48 +01:00
abdulhakeem
ff13dc6825 Still experimenting 2025-01-30 23:51:31 -06:00
Andreas Koepf
d90548658e fix chain_sum unit test 2025-01-30 10:57:55 +01:00
Andreas Koepf (aider)
a0c9c6b4fb test: Add comprehensive unit tests for QuantumLockDataset 2025-01-30 01:21:15 +01:00
Andreas Koepf
ad9f0d265c fix unit tests, lower python dependency to 3.9 2025-01-26 16:55:17 +01:00
Andreas Koepf
519e411fa5 add reasoning_gym.create_dataset({name}, ...) global factory function 2025-01-25 00:58:34 +01:00
Andreas Koepf
0d2d8ba6a0 pass config to ProceduralDataset base 2025-01-25 00:23:05 +01:00
Andreas Koepf (aider)
b3a2b719c7 feat: Register chain_sum dataset with register_dataset function 2025-01-25 00:01:41 +01:00
Andreas Koepf
5c5d46b4bd formatting, cleanup 2025-01-24 17:12:42 +01:00
Andreas Koepf
9d436af2b9 Add support for integer division in basic arithmetic dataset 2025-01-24 15:07:31 +01:00
Andreas Koepf (aider)
3ac07f6558 feat: Add support for integer division in BasicArithmeticDataset 2025-01-24 11:30:38 +01:00
Andreas Koepf (aider)
88fe366ef6 refactor: Simplify GCD number generation logic with break statement 2025-01-24 11:10:06 +01:00
Andreas Koepf (aider)
1bf62cbac2 refactor: Simplify LCM number generation with break and single return 2025-01-24 11:08:30 +01:00
Andreas Koepf
aaabc05ace formatting 2025-01-24 10:34:07 +01:00
Andreas Koepf
0e9250bce0 Rename ArithmeticDataset to BasicArithmeticDataset 2025-01-24 10:31:26 +01:00
Andreas Koepf (aider)
4817c33b65 refactor: Inherit LegCountingDataset from ProceduralDataset 2025-01-24 10:19:11 +01:00
Andreas Koepf
fef97fd2bb refactor: Simplify imports in leg_counting.py 2025-01-24 10:19:10 +01:00