Protocol Coverage
When deploying agent commerce, every protocol interface must be provably correct — coverage maps protocols to contracts to tests so trust is computed, not claimed.
Why should I care?
Five cards that sell the dream
Same five positions. Different seat.
The customer sees trust as a guarantee. The builder sees trust as a test suite. The Commerce Authorization Chain is both the product promise and the engineering spec.
How does this get built?
Five cards that sell the process
The pitch names the gap. The flow diagrams prove the dependency. The VV stories validate each link.
Problem
49 protocols mapped across 5 domains. 20 tested (41%). Commerce Authorization Chain has untested links at every stage after USER AUTH. Verifiable Intent has zero tests. On-Chain Trust has zero tests. Dream repo documents 8 agent protocols but no bridge to engineering verification results. Trust is claimed in documentation, not computed from test evidence.
Every protocol has a Zod contract and a passing test spec. Coverage percentage computed from test results and surfaced in the feature matrix. Commerce Authorization Chain verified end-to-end: USER AUTH → INTENT CAPTURE → AGENT ACTION → SETTLEMENT → AUDIT.
29 protocols lack tests. 10 have no contracts at all. Three distinct test tiers (Orchestration, Protocol, Network) need different infrastructure. Verifiable Intent requires FIDO keys. On-Chain Trust requires Sui testnet. Payment Execution requires x402 handshake servers. Each domain has different external dependencies.
Verifiable Intent and On-Chain Trust require real external infrastructure — FIDO authenticators, Sui testnet wallets, escrow contracts. A test that mocks the protocol is not a protocol test. The hardest part is standing up real infrastructure for protocols that don't have commodity test tooling yet.
Scorecard
Priority (5P)
Readiness (5R)
What Exists
| Component | State |
|---|---|
| intents-e2e test app | Working |
| A2A Protocol tests (Domain 1) | Partial |
| Auth tests (Domain 4) | Partial |
| Stripe webhook test (Domain 3) | Working |
| Verifiable Intent (Domain 2) | Missing |
| On-Chain Trust (Domain 5) | Missing |
| Coverage computation | Missing |
| HITL oversight patterns | Missing |
| MCP documentation | Working |
| MCP servers (active) | Working |
Relationships
| PRD | Contributes |
|---|---|
| Agent Platform | Peer — Agent Platform is the PUMP (identity, memory, scaffolds). Protocol Coverage is the GAUGE (trust verification). Different feature IDs, same foundation. |
| Automated Commissioning | Enables — Protocol Coverage produces test results that Automated Commissioning consumes to compute feature states. |
| CLI Platform | Depends on — test commands and coverage reporting run through the unified CLI. |
| Prediction Game (Sui) | Prediction Game needs PROT-005 (On-Chain Trust) for atomic settlement. |
| Sales CRM | Sales CRM needs PROT-003 (Payment Execution) for Stripe + payment intent flows. |
After 60 days of active development, if protocol coverage has not moved from 41% to 60%, the contract-first approach is too slow. Switch to integration-test-only coverage or re-scope to Commerce Authorization Chain only.
Questions
If trust is computed not claimed, what happens when a protocol test fails in production — does the system degrade gracefully or does the whole Commerce Authorization Chain break?
- Which of the 29 untested protocols would cause the most damage if it failed silently in a real transaction?
- Is contract-first testing (Zod schemas before specs) faster or slower than writing specs first and deriving contracts?
- When Verifiable Intent requires FIDO keys, is the test proving the protocol or proving the FIDO implementation?