reasoning-gym

mirror of https://github.com/open-thought/reasoning-gym.git synced 2026-04-22 16:49:06 +00:00

Author	SHA1	Message	Date
joesharratt1229	d0ef136d5b	Feat/intragen experiments (#414 ) * added curriculum * readapted readme * corrected small errors * Delete eval/eval/r1/algorithmic/word_sorting.json * removed redundant argument * added spell * removed duplicated fit * changed config * added composite changes * added composite changes * updated yaml * added spell backward * updated read me * added qwen2.5 * added * Add files via upload * updated missing trainer func * updated curr * updated spell back * updated correctness score func * updated configs * added local evals * added updates * updated datasets * added fsdp to hf utility * added algorithmic qwen 3b yaml * updated read me * updated configs * added preappend token * updated with thinking token * updated test score board * resolved comments * added evaluation scripts * removed results from pr * added config * added partial reward scoring * added evaluation composites * added training configs * added games eval * added rubriks cube * resolved merge cinflicts * added games config * added latest eval configs * updated strucutre * Delete training/evaluations/eval_graphs_composite.yaml --------- Co-authored-by: joesharratt1229 <joesharrat1229@gmail.com>	2025-04-16 08:04:52 +02:00
Zafir Stojanovski	dced3bfc45	fix(curriculum): Make boundaries in curriculum more sensible (#407 ) * init * fix tests * unify codeio * filtered for libraries not present in reasoning-gym * fix more bounds * puzzle24 * knight swap curriculum * fix number sorting * fix attributes * add validation of config in creation of dataset * dry run for instantiating and validating the datasets * remove unused imports * fix curriculum tests to reference newly updated attribute names	2025-04-04 20:24:14 +02:00
Zafir Stojanovski	ce0a6c4878	fix(envs): Add source dataset and index to metadata (#388 ) * add source dataset and index to metadata * fix typo * fix coach class and its test	2025-03-20 11:12:14 +00:00
Oliver Stanley	7475a20700	include ranges rather than sampled values in difficulty metadata dicts (#387 ) * update difficulty metadata for logic datasets * update difficulty metadata for graph datasets * update difficulty metadata for geometry datasets * update difficulty metadata for games datasets * update difficulty metadata for cognition datasets * update difficulty metadata for arithmetic datasets * update difficulty metadata for arc datasets * update difficulty metadata for algorithmic datasets * update difficulty metadata for algebra datasets * use tuples * update tests * update tests	2025-03-20 10:27:03 +01:00
Andreas Köpf	d2c895f1d3	Refactor Curriculum Attributes (#335 ) * remove min_value from AttributeDefinition * remove type from AttributeDefinition * Add CurriculumContext * add ensure_interval option for RangeAttributes * docs: Add legend explaining curriculum indicators in dataset gallery * update GALLERY.md	2025-03-16 15:40:28 +01:00
joesharratt1229	0dce7adbad	Curriculum/cognition (#314 ) * added rectangle count curriculum * added number sequences * registered curriculum	2025-03-11 00:10:28 +01:00
Andreas Köpf	5d7fbac0ad	Minor question template & score_answer improvements (#261 ) * math prompt improvements * ignore brackets in complex_arithmetic results * improve additional instruction in prompt of polynomial_equations * more strict tests for score_answer in polynomial_equations * simplify special reward handling * fix test_intermediate_integration * fix sokoban dataset * add common dataset score_answer consistency test	2025-03-04 21:55:09 +01:00
Zafir Stojanovski	01e1c8f9af	fix: Unify Prompts (#254 ) * remove cot * fix prompt template * fix pool matrix * spiral matrix fixed	2025-03-03 21:55:53 +01:00
Andreas Koepf	3e7ff3b084	use native types List->list, Dict->dict, Set->set, Tuple->tuple	2025-02-21 15:15:38 +01:00
Rich Jones	a9bbdd292a	rc gallery format	2025-02-20 11:26:05 +01:00
Zafir Stojanovski	c344a4de35	fix prompt	2025-02-17 16:12:50 +01:00
Rich Jones	c2fb8bb6cc	add rectangle count dataset	2025-02-11 13:56:27 +01:00

12 commits