Commit graph

6 commits

Author SHA1 Message Date
Andreas Köpf
b2904ccab9 Minor question template & score_answer improvements (#261)
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
ahmedsaif
df4bb0b897 Add allow-negation in products dataset 2025-02-20 12:01:09 +02:00
Andreas Koepf
9a21e6776d adapt unit tests to partial match changes 2025-02-14 21:30:50 +01:00
Andreas Koepf
1996ffa6d8 add ProductsDataset (multiplication tasks) 2025-02-13 17:59:02 +01:00
Andreas Koepf (aider)
17485fad67 test: Add scoring tests for Products dataset 2025-02-13 17:52:32 +01:00
Andreas Koepf (aider)
583ebfd5de feat: Add ProductsDataset with configurable terms and digits 2025-02-13 17:50:19 +01:00