Proof receipts

Shipped this week.

Sixty-four shipped artifacts across five repos in seven days. Each link below is a public PR or commit a visitor can inspect.

Total shipped: 64
Repos touched: 5
Week of: 2026-05-14
Listed receipts: 32

By repo

personalWebsite12knowledge-harness9agent-scorecard5digital-twin5stevenchouai1

2026-05-14

PRpersonalWebsite

Weekly shipped receipts page

A public page that lists every shipped PR and commit from the past week, with links a visitor can verify.

Verify: Open the route /proof-chain/shipped and click any link.

2026-05-13

PRpersonalWebsite

Three-agent daily builder log

Public blog post documenting the Product Owner / Builder / Reviewer loop for May 13.

Verify: Read the MDX frontmatter and compare with the day's PRs.

PRpersonalWebsite

Builder pulse dashboard

A /pulse page with real engineering metrics: shipped count, repos touched, streak days.

Verify: Open /pulse and check that the numbers match the PR list.

PRpersonalWebsite

Public proof control room

A /proof-chain/control-room page showing live build status and shipped artifacts.

Verify: Open the route and verify links resolve to real PRs.

PRagent-scorecard

Evidence intake checklist

A checklist for deciding whether an agent trace has enough evidence to score.

Verify: Run the examples and check that the checklist gates scoring.

PRagent-scorecard

Agent trust contract proof page

A trust contract page: what the scorecard checks, what it does not, and where the boundary is.

Verify: Read the page and verify the contract matches the score logic.

PRagent-scorecard

Failure replay proof surface

A failure replay that shows what the agent did wrong and what the scorecard caught.

Verify: Run the replay example and compare output with the trace.

Change classification gate

A gate that classifies agent changes by risk level before they land.

Verify: Read the doc and verify the classification criteria are explicit.

Capability contract demo

A capability contract: what the twin can do, what it cannot, and what needs human review.

Verify: Open the demo and verify the contract boundaries match reality.

Agent control plane demo

A control plane demo for auto-run / review / stop decisions on agent work.

Verify: Read the decision criteria and verify they are testable.

PRknowledge-harness

Doctor JSON output

A doctor command with JSON output for programmatic health checks.

Verify: Run knowledge-harness doctor --json and verify the schema.

PRknowledge-harness

Redact sensitive roots in run metadata

Redacts local vault paths from run metadata before public output.

Verify: Run a demo and verify no local paths appear in the output.

PRknowledge-harness

Proposal convention doc

A doc convention for proposals: context, scope, acceptance criteria, risk.

Verify: Read the doc and verify the convention is self-consistent.

PRknowledge-harness

Prompt command read-only mode

Keeps the prompt command read-only: it shows what would run without executing.

Verify: Run knowledge-harness prompt and verify no vault writes occur.

2026-05-12

PRknowledge-harness

Redact command metadata paths

Redacts local file paths from command metadata in public run receipts.

Verify: Run a command and verify the receipt has no local paths.

PRpersonalWebsite

Agent scorecard demo route

A /demo/agent-scorecard route on the public site showing a live walkthrough.

Verify: Open the route and follow the demo steps.

PRpersonalWebsite

Bug feature gate note

A blog note on when to treat a bug as a feature gate vs a defect.

Verify: Read the post and verify the framing is from real experience.

2026-05-11

PRpersonalWebsite

Open-source agent proof

A page showing open-source contributions as proof of real-world agent work.

Verify: Open the page and verify the linked PRs are real.

Decision receipt demo

A decision receipt: what the agent decided, why, and what evidence it used.

Verify: Read the receipt and verify the evidence links are public.

Agent work receipt demo

An agent work receipt template showing inputs, actions, outputs, and verification.

Verify: Read the template and verify it matches the scoring rubric.

PRagent-scorecard

Visitor scorecard walkthrough

A self-contained HTML walkthrough explaining how a trace becomes checks, a score, and a trust decision.

Verify: Open the HTML file in a browser and follow the steps.

PRknowledge-harness

Demo evidence bundle

A demo command that bundles evidence from a vault run into a single reviewable artifact.

Verify: Run knowledge-harness demo and inspect the output bundle.

PRknowledge-harness

Custom demo questions

Supports custom questions in the demo command for flexible proof generation.

Verify: Run knowledge-harness demo --question '...' and verify the output.

PRknowledge-harness

Markdown receipt output

Markdown output format for demo receipts, suitable for public docs.

Verify: Run knowledge-harness demo --markdown and verify the format.

PRknowledge-harness

JSON receipt output

JSON output format for demo receipts, suitable for programmatic use.

Verify: Run knowledge-harness demo --json and verify the schema.

2026-05-10

PRagent-scorecard

Delegation policy simulator

A static simulator that maps agent scores to autonomy decisions.

Verify: Run the simulator with different scores and verify the output.

Profile proof links

A GitHub profile README with proof links, 60-second proof route, and StevenOS stack.

Verify: Visit github.com/stevenchouai and verify the links resolve.

PRpersonalWebsite

Builder note: A demo needs a stopwatch

A blog post arguing that demos without measurable proof are just marketing.

Verify: Read the post and verify the argument is concrete.

PRpersonalWebsite

Proof-first homepage

Homepage restructured to lead with proof metrics instead of biography.

Verify: Visit the homepage and verify the first screen shows proof.

2026-05-09

PRpersonalWebsite

Site quality gate CI

A CI workflow that checks site quality on every PR: lint, build, link validation.

Verify: Check the PR's CI status and verify the workflow runs.

PRpersonalWebsite

Now page proof ledger

A /now page with a proof ledger: what's active, what shipped, what's next.

Verify: Open /now and verify the entries link to real artifacts.

2026-05-08

PRpersonalWebsite

Proof-chain link validator

A script that validates all proof-chain links resolve to real artifacts.

Verify: Run npm run validate:proof-links and verify the output.

How to verify

Every link above is a public GitHub PR or commit. Click it, read the diff, check the CI status, and decide for yourself. The autonomous loop that produced these runs every 30 minutes while Steven is idle. Each artifact was independently reviewed before shipping.

Open proof chain Audit trail Build log