Update README.md

This commit is contained in:
Aditya Mehta 2025-05-18 12:58:58 -07:00 committed by GitHub
parent f052f14484
commit b776d19f7f
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -1,7 +1,30 @@
# Readme
a link to a 1 minute youtube video
an explanation of your env design and motivation
quickstart docs
a link to a public wandb run from process and explanations of added metrics
additional details about your env, e.g. reward hacking
##1-Minute Demo Video
Watch the demo on YouTube
##Environment Design & Motivation
> NousWhiteHouse is a reinforcement learning (RL) project focused on improving agent tool calls using the Model Context Protocol (MCP). The goal is to enable agents to dynamically discover and invoke tools more effectively, leveraging MCP for context-aware decision-making.
> After replicating RESTGPT, we noticed that LLMs struggled to find the right tools to call, such as finding Gims songs on Spotify. Instead of manually matching multiple APIs, the recent advent of Modex Context Protocol inspires us to double down on tool-calling efforts
> Our main task or challenge that our environment presented
> Why is this environment interesting or useful for RL research
> What inspired our design choices
🔖 Environment Snapshot
| Field | Entry |
| ------------- | ------------- |
| Environment Name | Content Cell |
| Short Description | Content Cell |
| Category | Content Cell |
| Dataset Needed? | Content Cell |
| External Deps | Content Cell |
| Environmental Variables | Content Cell |
| Compute Footprint Estimate | Content Cell |
Estimate
🧪 Zero-Training Test Results
Details
W&B Link:
Examples of the Environment scoring a good example and a bad example