Commit graph

27 commits

Author SHA1 Message Date
Andreas Koepf
acde58a200 use Decimal class for numeric comparison e.g. +0123.100 == 123.1 2025-02-21 15:36:06 +01:00
Andreas Koepf
3e7ff3b084 use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
AhmedSaif2
5d02064b5a add a helper function to handle redundant code 2025-02-21 15:54:00 +02:00
AhmedSaif2
dcdef3f9ec Add score answer to support comma format 2025-02-20 20:52:31 +02:00
Andreas Koepf
667d088f55 remove redundant assert in ChainSumConfig.validate() 2025-02-19 09:42:32 +01:00
joesharratt1229
2071ad42c2 fixed chain sum 2025-02-16 14:09:16 +00:00
Andreas Koepf
5410bb78a0 add ProductsDataset (multiplication tasks) 2025-02-13 17:59:02 +01:00
Andreas Koepf
ab9f781d97 use *args param for _define_attributes() 2025-02-12 16:59:09 +01:00
Andreas Koepf
8772041afb Add attributes for curriculum
Co-authored-by: EduardDurech <39579228+EduardDurech@users.noreply.github.com>
2025-02-10 18:58:07 +01:00
Andreas Köpf
3f6b2fc807
Add Coaching & ScoreBoard class (result tracking) (#72)
* feat: Add Coach and ScoreBoard classes for performance tracking and difficulty adjustment
* feat: Add GroupedScores class to wrap aggregated scores
* refactor: Create ScoreStats class with tuple-based score statistics
* feat: Add unit test for Coach with CompositeDataset and multiple datasets
* fix: Add difficulty metadata to leg counting dataset
* feat: Add clear() method to ScoreBoard to reset all stored data
* feat: Add __len__ method to ScoreBoard to return number of scores
* feat: Add update_dataset_config method to CompositeDataset
* cleanup __init__ & imports
2025-02-06 23:15:28 +01:00
Andreas Koepf
5b35ea51a7 fix chain_sum unit test 2025-01-30 10:57:55 +01:00
Andreas Koepf (aider)
9480c18e16 test: Add comprehensive unit tests for QuantumLockDataset 2025-01-30 01:21:15 +01:00
Andreas Koepf
0dcff77b37 add reasoning_gym.create_dataset({name}, ...) global factory function 2025-01-25 00:58:34 +01:00
Andreas Koepf
e9549f2a63 pass config to ProceduralDataset base 2025-01-25 00:23:05 +01:00
Andreas Koepf (aider)
df2b8d2809 feat: Register chain_sum dataset with register_dataset function 2025-01-25 00:01:41 +01:00
Andreas Koepf
20069b2a7d formatting 2025-01-24 10:34:07 +01:00
Andreas Koepf (aider)
d191e78a28 refactor: Inherit ChainSum from ProceduralDataset base class 2025-01-24 09:57:26 +01:00
Andreas Koepf (aider)
45330da122 refactor: Rename chain_sum to chain_sum_dataset for consistency 2025-01-23 22:27:48 +01:00
Andreas Koepf (aider)
48492c4fd8 feat: Add arithmetic_dataset() factory function to basic_arithmetic.py 2025-01-23 12:47:01 +01:00
Andreas Koepf (aider)
7cce205c5d feat: Add iterator support to ChainSum with size-respecting iteration 2025-01-23 12:23:35 +01:00
Andreas Koepf (aider)
516d4d20d4 feat: Add special case handling for min_digits=1 in ChainSum generation 2025-01-23 12:07:56 +01:00
Andreas Koepf (aider)
4777e6b435 refactor: Move min_value and max_value calculations to __getitem__ 2025-01-23 12:05:55 +01:00
Andreas Koepf (aider)
d2825f41ce feat: Implement allow_negation to generate both positive and negative numbers in ChainSum 2025-01-23 12:01:21 +01:00
Andreas Koepf
c3bce305c1 refactor: Replace Random import with random module and update type hints 2025-01-23 12:01:20 +01:00
Andreas Koepf (aider)
4aeb76ae8c refactor: Simplify ChainSum random number generation with base seed 2025-01-23 11:56:36 +01:00
Andreas Koepf (aider)
626fd78bda feat: Add digit-based number range generation for chain sum tasks 2025-01-23 11:46:55 +01:00
Andreas Koepf (aider)
c8aa98f4e8 feat: Add ChainSum class for generating simple arithmetic tasks 2025-01-23 11:40:00 +01:00