Skip to main content

Validate Outcomes

Did you deliver what you said you would?

The gap between intention and reality is the honest error signal. The builder knows what they intended. The commissioner checks what actually shipped. These are never the same person.

Commissioning closes the feedback loop. Without it, the pipeline from pain to spec produces artifacts nobody verifies. Specs accumulate. Confidence erodes. The loop runs open.

Three credibility loops run through this pipeline. The inner loop (L1-L3) proves the code works. The story loop (L3 bridge) proves your predictions match your results. The market loop (L4) proves others validate with their behavior. Market credibility is the greatest force — but it can only land on a foundation where inner and story loops are tight. The SPEC-MAP is the shared traceability artifact that keeps both inner loops honest.

Fidelity Levels

Common LanguageOur TermWhat It Means
PrototypeL0-L1Idea captured, code exists but unverified
AlphaL2Core flow works, engineer verified
BetaL3Independent commissioner verified against spec
ProductionL4External users validate with their behavior

Dig Deeper

  • Validate Internal Standards — Engineering checklist: types, tests, performance gates, security. Does the code meet its own contracts?
  • Validate Results — L0-L4 commissioning protocol. Does the deployed capability match the PRD spec? Independent verification with evidence

Commissioning Workflow

The commissioner reads. Then walks. Then records.

StepActionOutput
1Read PRD and SPEC-MAPKnow what was promised
2Open the deployed capabilitySee what exists
3Walk every feature rowHappy path, error path, edge cases
4Record evidenceScreenshot, GIF, console output, measurement
5Update SPEC-MAP L-levelGap, L0, L1, L2, L3, or L4
6Mark L4 if all features passOr document the gap and route it back

Authority

The commissioner is never the builder. Three powers:

PowerWhenEffect
HOLDFeature fails specBlocks promotion until gap is closed
Re-specSpec was wrong, not buildRoutes signal back to Dream Team
KillEffort exceeds valueRecommends PRD status change to STOP

Cadence

TriggerAction
Feature reaches L3Commission in same cycle
L4 verificationNext cycle (independent commissioner)
5 PRDs reach L2+Run integral calibration
Quarterly minimumFull scoring recalibration regardless of volume

Return Signal

Commission finds gap → SPEC-MAP updated → Dream reads gap → spec evolves → next build cycle. This IS the VVFL Reflect station. Without it, the loop runs open.

Gap TypeOwnerAction
Build gap (code doesn't match spec)EngineeringFix and re-deploy
Spec gap (spec was wrong)Dream TeamRe-spec, re-score, possibly kill
Instrument gap (can't verify)PlatformBuild the MCP or tool needed

Context

  • VVFL — The loop this validates — commissioning is the Reflect station made mechanical
  • Feature Matrix — Live commissioning status for every capability
  • Flow Engineering — The build process this validates
  • Create PRD Stories — The spec this verifies against
  • Commissioning — The principle: why independent verification matters
  • Verifiable Intent — L4 commissioning IS verifiable intent for software
  • Credibility — Commissioning evidence feeds all three credibility loops
  • Predictions — Gap between predicted and actual IS the learning signal

Questions

When the gap between spec and reality is large, is the spec wrong or the build wrong — and how do you tell?

  • At what maturity level does a capability start generating value — is L4 necessary for first customers?
  • What's the cost of the builder commissioning their own work — and how often does it happen without anyone noticing?
  • When commissioning reveals a spec gap (not a build gap), how does that signal flow back to the Dream Team?