Update README.md

2026-04-19 12:57:58 +00:00 · 2025-05-18 20:50:53 -04:00 · 2025-05-18 20:50:53 -04:00 · baa6a1feef
commit baa6a1feef
parent 320614e294
1 changed files with 2 additions and 2 deletions
--- a/environments/hack0/README.md
+++ b/environments/hack0/README.md
@ -44,7 +44,7 @@ visualization_dir: "./rubiks_visualizations/"

 ## Performance Metrics & Training (150 words)

-[View WandB Run Results](https://wandb.ai/team/project/runs/abc123)
+[View WandB Run Results]([https://wandb.ai/team/project/runs/abc123](https://wandb.ai/joshuaxjerin-uc/atropos-environments?nw=nwuserjoshuaxjerin))

 Our environment tracks several key metrics:

@ -78,4 +78,4 @@ Our reward function combines:
 3. Move efficiency compared to optimal solve
 4. Quality of reasoning in "thinking aloud" steps

-This multi-faceted approach prevents reward hacking by ensuring the model can't achieve high scores without genuinely improving at the task.
+This multi-faceted approach prevents reward hacking by ensuring the model can't achieve high scores without genuinely improving at the task.