reasoning-gym/reasoning_gym
Andreas Köpf b2904ccab9 Minor question template & score_answer improvements (#261)
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
..
algebra Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
algorithmic Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
arc Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
arithmetic Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
coaching use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
code Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
cognition Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
data Move data file & load into memory on first object creation 2025-02-25 22:36:38 +00:00
games Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
geometry fix: Unify Prompts (#254) 2025-03-03 21:55:53 +01:00
graphs Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
induction more native type hints 2025-02-21 21:23:14 +01:00
logic Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
__init__.py bump version, pypi release of 0.1.12 2025-02-26 18:25:16 +01:00
composite.py use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
dataset.py Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
factory.py use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
utils.py Minor question template & score_answer improvements (#261) 2025-03-04 21:55:09 +01:00
version_manager.py use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00