diff --git a/README.md b/README.md index b9e5ebc..4a278aa 100644 --- a/README.md +++ b/README.md @@ -1,14 +1,16 @@ -# YC-Bench +# YC-Bench logo YC-Bench A long-horizon deterministic benchmark for LLM agents. The agent plays CEO of an AI startup over a simulated 1–3 year run, operating exclusively through a CLI tool against a SQLite-backed discrete-event simulation. -The benchmark tests whether agents can manage compounding decisions: prestige specialisation, employee allocation, cash flow, and deadline risk — sustained over hundreds of turns. +The benchmark tests whether agents can manage compounding decisions: prestige specialisation, employee allocation, cash flow, and deadline risk - sustained over hundreds of turns. --- ## Simulation Dynamics -``` +![YC Bench Architecture](imgs/arch.png "Architecture YC-Bench") + + ### Core loop diff --git a/imgs/arch.png b/imgs/arch.png new file mode 100644 index 0000000..fb33770 Binary files /dev/null and b/imgs/arch.png differ diff --git a/imgs/yc_bench.png b/imgs/yc_bench.png new file mode 100644 index 0000000..cc026e2 Binary files /dev/null and b/imgs/yc_bench.png differ