mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2026-04-26 17:13:17 +00:00
add quantum lock answer format hint
This commit is contained in:
parent
63a8c94d85
commit
6ecd25c283
1 changed files with 1 additions and 1 deletions
|
|
@ -28,7 +28,7 @@ class QuantumLockDataset(ProceduralDataset):
|
|||
self._prompt_templates = [
|
||||
"""\
|
||||
In front of you are some buttons, a light, and a number. The light will toggle between red and green whenever you press a button. Each button performs a mathematical operation to the number, but the operation may depend on the state of the light.
|
||||
You must press the shortest correct sequence of buttons to reach the target value.
|
||||
You must press the shortest correct sequence of buttons to reach the target value. Your answer should be a sequence of buttons separated by '→', for example: A → B → C
|
||||
|
||||
Start: {initial_value} ({initial_state})
|
||||
Target: {target_value}
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue