reasoning-gym/reasoning_gym/arithmetic
Andreas Köpf 5d7fbac0ad
Minor question template & score_answer improvements (#261)
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
..
gsm_symbolic more native type hints 2025-02-21 21:23:14 +01:00
__init__.py add to init 2025-02-21 12:07:17 +01:00
basic_arithmetic.py use Decimal class for numeric comparison e.g. +0123.100 == 123.1 2025-02-21 15:36:06 +01:00
bitwise_arithmetic.py Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
calendar_arithmetic.py Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
chain_sum.py Remove strip from ProceduralDataset::core score_answer() (#250) 2025-03-02 08:46:36 +01:00
count_bits.py count bits (#101) 2025-02-10 22:12:50 +01:00
decimal_arithmetic.py Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
decimal_chain_sum.py Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
dice.py Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
fraction_simplification.py use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
gcd.py use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
lcm.py use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
leg_counting.py fix: Unify Prompts (#254) 2025-03-03 21:55:53 +01:00
number_format.py Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
power_function.py Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
prime_factorization.py use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
products.py Remove strip from ProceduralDataset::core score_answer() (#250) 2025-03-02 08:46:36 +01:00
time_intervals.py Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00