Commit graph

6 commits

Author SHA1 Message Date
Andreas Köpf
5d7fbac0ad
Minor question template & score_answer improvements (#261)
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
ahmedsaif
14b2f1f99d Add allow-negation in products dataset 2025-02-20 12:01:09 +02:00
Andreas Koepf
044da9687f adapt unit tests to partial match changes 2025-02-14 21:30:50 +01:00
Andreas Koepf
5410bb78a0 add ProductsDataset (multiplication tasks) 2025-02-13 17:59:02 +01:00
Andreas Koepf (aider)
ce30536627 test: Add scoring tests for Products dataset 2025-02-13 17:52:32 +01:00
Andreas Koepf (aider)
bdcaeff42a feat: Add ProductsDataset with configurable terms and digits 2025-02-13 17:50:19 +01:00