Andreas Koepf
acde58a200
use Decimal class for numeric comparison e.g. +0123.100 == 123.1
2025-02-21 15:36:06 +01:00
Andreas Koepf
3e7ff3b084
use native types List->list, Dict->dict, Set->set, Tuple->tuple
2025-02-21 15:15:38 +01:00
AhmedSaif2
5d02064b5a
add a helper function to handle redundant code
2025-02-21 15:54:00 +02:00
AhmedSaif2
dcdef3f9ec
Add score answer to support comma format
2025-02-20 20:52:31 +02:00
Andreas Koepf
667d088f55
remove redundant assert in ChainSumConfig.validate()
2025-02-19 09:42:32 +01:00
joesharratt1229
2071ad42c2
fixed chain sum
2025-02-16 14:09:16 +00:00
Andreas Koepf
5410bb78a0
add ProductsDataset (multiplication tasks)
2025-02-13 17:59:02 +01:00
Andreas Koepf
ab9f781d97
use *args param for _define_attributes()
2025-02-12 16:59:09 +01:00
Andreas Koepf
8772041afb
Add attributes for curriculum
...
Co-authored-by: EduardDurech <39579228+EduardDurech@users.noreply.github.com>
2025-02-10 18:58:07 +01:00
Andreas Köpf
3f6b2fc807
Add Coaching & ScoreBoard class (result tracking) ( #72 )
...
* feat: Add Coach and ScoreBoard classes for performance tracking and difficulty adjustment
* feat: Add GroupedScores class to wrap aggregated scores
* refactor: Create ScoreStats class with tuple-based score statistics
* feat: Add unit test for Coach with CompositeDataset and multiple datasets
* fix: Add difficulty metadata to leg counting dataset
* feat: Add clear() method to ScoreBoard to reset all stored data
* feat: Add __len__ method to ScoreBoard to return number of scores
* feat: Add update_dataset_config method to CompositeDataset
* cleanup __init__ & imports
2025-02-06 23:15:28 +01:00
Andreas Koepf
5b35ea51a7
fix chain_sum unit test
2025-01-30 10:57:55 +01:00
Andreas Koepf (aider)
9480c18e16
test: Add comprehensive unit tests for QuantumLockDataset
2025-01-30 01:21:15 +01:00
Andreas Koepf
0dcff77b37
add reasoning_gym.create_dataset({name}, ...) global factory function
2025-01-25 00:58:34 +01:00
Andreas Koepf
e9549f2a63
pass config to ProceduralDataset base
2025-01-25 00:23:05 +01:00
Andreas Koepf (aider)
df2b8d2809
feat: Register chain_sum dataset with register_dataset function
2025-01-25 00:01:41 +01:00
Andreas Koepf
20069b2a7d
formatting
2025-01-24 10:34:07 +01:00
Andreas Koepf (aider)
d191e78a28
refactor: Inherit ChainSum from ProceduralDataset base class
2025-01-24 09:57:26 +01:00
Andreas Koepf (aider)
45330da122
refactor: Rename chain_sum to chain_sum_dataset for consistency
2025-01-23 22:27:48 +01:00
Andreas Koepf (aider)
48492c4fd8
feat: Add arithmetic_dataset() factory function to basic_arithmetic.py
2025-01-23 12:47:01 +01:00
Andreas Koepf (aider)
7cce205c5d
feat: Add iterator support to ChainSum with size-respecting iteration
2025-01-23 12:23:35 +01:00
Andreas Koepf (aider)
516d4d20d4
feat: Add special case handling for min_digits=1 in ChainSum generation
2025-01-23 12:07:56 +01:00
Andreas Koepf (aider)
4777e6b435
refactor: Move min_value and max_value calculations to __getitem__
2025-01-23 12:05:55 +01:00
Andreas Koepf (aider)
d2825f41ce
feat: Implement allow_negation to generate both positive and negative numbers in ChainSum
2025-01-23 12:01:21 +01:00
Andreas Koepf
c3bce305c1
refactor: Replace Random import with random module and update type hints
2025-01-23 12:01:20 +01:00
Andreas Koepf (aider)
4aeb76ae8c
refactor: Simplify ChainSum random number generation with base seed
2025-01-23 11:56:36 +01:00
Andreas Koepf (aider)
626fd78bda
feat: Add digit-based number range generation for chain sum tasks
2025-01-23 11:46:55 +01:00
Andreas Koepf (aider)
c8aa98f4e8
feat: Add ChainSum class for generating simple arithmetic tasks
2025-01-23 11:40:00 +01:00