Skip to content

Architecture

StevenOS: one system, 9 repos, five layers.

Every project in this portfolio rolls up into a single Personal AI Operating System. This map shows how they connect — from infrastructure plumbing to the public proof a visitor can inspect.

visitor sees this firstinfrastructure underneath

Evaluation

Quality gates that decide whether agents deserve more trust.

How data flows through the stack

Knowledge → Runtime: The digital-twin identity files and knowledge-harness vault provide context to agents before they act.

Runtime → Evaluation: Agent work produces traces. The scorecard inspects those traces for tool use, verification, and durable artifacts.

Evaluation → Public Proof: Passing checks produce receipts, build-log entries, and shipped-receipt pages that visitors can verify independently.

Infrastructure → Runtime: CLIProxyAPI routes model requests. Manage-up provides manager-facing MCP skills. Both feed into the agent runtime.

What is not shown

Production work systems, internal tooling, private vault content, and employer-specific code are intentionally excluded. The map shows only public-safe repos and contributions.