Commit graph

5 commits

Author SHA1 Message Date
Rich Jones
ce3e9bebea prompts - no reasoning in answers 2025-02-14 12:42:24 +01:00
Zafir Stojanovski
52a56cbc4f system prompt for structured output, and parse such outputs 2025-02-12 10:44:42 +01:00
Andreas Koepf
3ca9a709e8 gsm_symbolic generator changes 2025-02-05 20:58:01 +01:00
Andreas Koepf
1bc56b8559 extract answer from last answer tag 2025-01-28 16:37:19 +00:00
Andreas Koepf
655de7a7f3 add first example with OpenRLHF 2025-01-28 14:40:06 +00:00