Commit graph

6 commits

Author SHA1 Message Date
Andreas Koepf
9d436af2b9 Add support for integer division in basic arithmetic dataset 2025-01-24 15:07:31 +01:00
Andreas Koepf
aaabc05ace formatting 2025-01-24 10:34:07 +01:00
Andreas Koepf
0e9250bce0 Rename ArithmeticDataset to BasicArithmeticDataset 2025-01-24 10:31:26 +01:00
Andreas Koepf (aider)
05ba566a51 feat: Add iteration support to ArithmeticDataset with size-based termination 2025-01-23 12:34:54 +01:00
Andreas Koepf (aider)
6b41e3cc47 fix: Prevent infinite loop in arithmetic dataset space generation 2025-01-23 11:34:18 +01:00
Andreas Koepf (aider)
473a2be31d feat: Unify arithmetic task generation with configurable dataset class
This commit introduces a new `ArithmeticDataset` class that:
- Combines complex and simple arithmetic task generation approaches
- Provides configurable task generation via `ArithmeticDatasetConfig`
- Supports deterministic task generation
- Implements dataset interface for easy use with HuggingFace datasets
- Adds comprehensive test coverage for the new implementation

Changes include:
- Refactored `basic_arithmetic.py` to use a unified dataset approach
- Added configuration validation and flexible generation options
- Created test suite to validate dataset behavior
- Removed file generation methods in favor of in-memory generation
2025-01-23 11:30:09 +01:00