mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2026-04-19 12:58:07 +00:00
* configs * reduce complexity of curriculum * update lower bound * add failure threshold * update last_k * update thresholds for success and failure * update curriculum file as well * update run name for noncurriculum * lint * dtype model eval * return binary scoring * set eval repeats to 3 * fix tests |
||
|---|---|---|
| .. | ||
| curriculum | ||
| inter_generalisation | ||
| intra-generalisation | ||
| lmeh | ||
| evaluate_model.py | ||