mirror of
https://github.com/NousResearch/atropos.git
synced 2026-04-19 12:57:58 +00:00
linting
This commit is contained in:
parent
2efb690a24
commit
e7e747a396
6 changed files with 8 additions and 9 deletions
|
|
@ -46,4 +46,4 @@ The environment implements a reward function that balances:
|
|||
2. Spatial relationship constraints
|
||||
3. Task completion verification
|
||||
|
||||
The current implementation focuses on basic spatial tasks but is designed to be extensible for more complex scenarios. The reward function is structured to prevent common reward hacking strategies by requiring both position accuracy and spatial relationship satisfaction.
|
||||
The current implementation focuses on basic spatial tasks but is designed to be extensible for more complex scenarios. The reward function is structured to prevent common reward hacking strategies by requiring both position accuracy and spatial relationship satisfaction.
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue