reasoning-gym/reasoning_gym
Ritvik Rastogi 49b07130b3
feat: add scoring cascade for reducing false negatives (#526)
* feat: add scoring cascade for reducing false negatives in answer verification

* style: fix black and isort formatting

Run black and isort to satisfy pre-commit checks.

Made-with: Cursor

* docs: add scoring cascade example to Quickstart section

Mention the experimental scoring cascade feature at the end of the
Quickstart section with a disclaimer and complete usage examples
showing both the dataset method and standalone function.

Made-with: Cursor

* docs: shorten scoring cascade section in README

Trim to a concise standalone example per review feedback.

Made-with: Cursor

* docs: simplify scoring cascade description in README

Made-with: Cursor

* update readme

---------

Co-authored-by: Zafir Stojanovski <zaf.stojano@gmail.com>
2026-04-17 21:39:15 +02:00
..
algebra make task entries json serializable (#443) 2025-06-02 08:57:15 +02:00
algorithmic Fix/cryptarithm multiple solutions (#517) 2026-03-15 13:53:26 +01:00
arc (evals): Medium configs (#415) 2025-04-14 08:25:31 +02:00
arithmetic [fix #484] resolve basic_arithmetic fails when size is large (#485) 2025-07-07 09:46:23 +01:00
coaching Curr exp (#487) 2025-07-25 20:38:47 +01:00
code Codeio prompt fix (#513) 2025-11-13 11:48:20 +01:00
cognition fix color_cubes answer strings, update gallery with latest envs (#464) 2025-06-08 13:16:54 +02:00
data fix encoding to be able to run on win (#502) 2025-08-18 09:19:45 +01:00
games Fix impossible_ratio not being respected in knight_swap (#521) (#524) 2026-03-27 15:18:08 +00:00
geometry make task entries json serializable (#443) 2025-06-02 08:57:15 +02:00
graphs add path-star task environment (#499) 2026-03-28 01:07:49 +01:00
induction fix(envs): Add source dataset and index to metadata (#388) 2025-03-20 11:12:14 +00:00
logic feat(curriculum): Knights and Knaves configs (#488) 2025-07-31 10:18:05 +02:00
probability Add probability dataset (initial: Coin Flip dataset + curriculum) (#505) 2025-09-06 15:59:23 +01:00
__init__.py feat: add scoring cascade for reducing false negatives (#526) 2026-04-17 21:39:15 +02:00
composite.py Feat/curr adj (#394) 2025-04-02 06:39:14 +01:00
dataset.py feat: add scoring cascade for reducing false negatives (#526) 2026-04-17 21:39:15 +02:00
factory.py Feat: expose score_answer function without needing to instantiate a dataset (#422) 2025-04-18 10:36:44 +02:00
scoring.py feat: add scoring cascade for reducing false negatives (#526) 2026-04-17 21:39:15 +02:00
utils.py support python 3.10 (#450) 2025-06-04 10:34:01 +01:00
version_manager.py use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00