Commit graph

27 commits

Author SHA1 Message Date
Andreas Koepf
476e37e70b use Decimal class for numeric comparison e.g. +0123.100 == 123.1 2025-02-21 15:36:06 +01:00
Andreas Koepf
ff5b210106 use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
AhmedSaif2
6b5c7a8637 add a helper function to handle redundant code 2025-02-21 15:54:00 +02:00
AhmedSaif2
1da1909666 Add score answer to support comma format 2025-02-20 20:52:31 +02:00
Andreas Koepf
e9a2097a71 remove redundant assert in ChainSumConfig.validate() 2025-02-19 09:42:32 +01:00
joesharratt1229
1a33fba608 fixed chain sum 2025-02-16 14:09:16 +00:00
Andreas Koepf
1996ffa6d8 add ProductsDataset (multiplication tasks) 2025-02-13 17:59:02 +01:00
Andreas Koepf
c6a9dae0c6 use *args param for _define_attributes() 2025-02-12 16:59:09 +01:00
Andreas Koepf
357a89fe8c Add attributes for curriculum
Co-authored-by: EduardDurech <39579228+EduardDurech@users.noreply.github.com>
2025-02-10 18:58:07 +01:00
Andreas Köpf
a607db79f7 Add Coaching & ScoreBoard class (result tracking) (#72)
* feat: Add Coach and ScoreBoard classes for performance tracking and difficulty adjustment
* feat: Add GroupedScores class to wrap aggregated scores
* refactor: Create ScoreStats class with tuple-based score statistics
* feat: Add unit test for Coach with CompositeDataset and multiple datasets
* fix: Add difficulty metadata to leg counting dataset
* feat: Add clear() method to ScoreBoard to reset all stored data
* feat: Add __len__ method to ScoreBoard to return number of scores
* feat: Add update_dataset_config method to CompositeDataset
* cleanup __init__ & imports
2025-02-06 23:15:28 +01:00
Andreas Koepf
d90548658e fix chain_sum unit test 2025-01-30 10:57:55 +01:00
Andreas Koepf (aider)
a0c9c6b4fb test: Add comprehensive unit tests for QuantumLockDataset 2025-01-30 01:21:15 +01:00
Andreas Koepf
519e411fa5 add reasoning_gym.create_dataset({name}, ...) global factory function 2025-01-25 00:58:34 +01:00
Andreas Koepf
0d2d8ba6a0 pass config to ProceduralDataset base 2025-01-25 00:23:05 +01:00
Andreas Koepf (aider)
b3a2b719c7 feat: Register chain_sum dataset with register_dataset function 2025-01-25 00:01:41 +01:00
Andreas Koepf
aaabc05ace formatting 2025-01-24 10:34:07 +01:00
Andreas Koepf (aider)
d1296630b4 refactor: Inherit ChainSum from ProceduralDataset base class 2025-01-24 09:57:26 +01:00
Andreas Koepf (aider)
562dfb1813 refactor: Rename chain_sum to chain_sum_dataset for consistency 2025-01-23 22:27:48 +01:00
Andreas Koepf (aider)
8d1dac9e62 feat: Add arithmetic_dataset() factory function to basic_arithmetic.py 2025-01-23 12:47:01 +01:00
Andreas Koepf (aider)
5c5cbf3194 feat: Add iterator support to ChainSum with size-respecting iteration 2025-01-23 12:23:35 +01:00
Andreas Koepf (aider)
95fdca5a92 feat: Add special case handling for min_digits=1 in ChainSum generation 2025-01-23 12:07:56 +01:00
Andreas Koepf (aider)
e1734b0b11 refactor: Move min_value and max_value calculations to __getitem__ 2025-01-23 12:05:55 +01:00
Andreas Koepf (aider)
1ff01627cc feat: Implement allow_negation to generate both positive and negative numbers in ChainSum 2025-01-23 12:01:21 +01:00
Andreas Koepf
358829cc60 refactor: Replace Random import with random module and update type hints 2025-01-23 12:01:20 +01:00
Andreas Koepf (aider)
654bed3146 refactor: Simplify ChainSum random number generation with base seed 2025-01-23 11:56:36 +01:00
Andreas Koepf (aider)
d46f338160 feat: Add digit-based number range generation for chain sum tasks 2025-01-23 11:46:55 +01:00
Andreas Koepf (aider)
3d9b48ba31 feat: Add ChainSum class for generating simple arithmetic tasks 2025-01-23 11:40:00 +01:00