reasoning-gym

mirror of https://github.com/open-thought/reasoning-gym.git synced 2026-04-29 17:35:16 +00:00

Author	SHA1	Message	Date
joesharratt1229	9f9f816902	added updates	2025-03-29 08:07:57 +00:00
joesharratt1229	c952f31a61	updated missing trainer func	2025-03-26 19:51:45 +00:00
joesharratt1229	b8a2ac6ba3	removed duplicated fit	2025-03-25 04:07:59 +00:00
joesharratt1229	9335b56252	corrected small errors	2025-03-25 03:45:18 +00:00
joesharratt1229	6fa76f11b5	added curriculum	2025-03-23 20:25:42 +00:00
Oliver Stanley	eb69916c1b	initial verl training codebase (#389 ) * fixes for latest verl * composite dataset training experiment * use stateful dataloaders to match verl changes * training readme * add formatting reward * length reward impl * standalone reasoning_gym config section * curriculum learning, new length reward, more config	2025-03-20 15:04:57 +00:00