Skip to main content

← Hemisphere Bridge → Agent Platform Phase 4 · Prompt Deck · Pictures

Hemisphere Bridge Spec v2

From: Outer Loop (Claude.ai) → Engineering Team (Nav) Date: 2026-03-11 Status: Draft — ready for engineering review Depends on: Nav Continuity Layer (Phase 0-4 complete), PRD Handoff Protocol, Agent Comms (Convex) Northstar: Every cycle of dream → spec → build → commission → dream-bigger leaves both hemispheres measurably stronger, takes less time, and the trust ledger confirms it.

1. Problem Statement

The dream hemisphere (mm.dreamineering.com) and the engineering hemisphere (stackmates) communicate through manual handoffs, stale instrument readings, and a protocol that exists on paper but isn't wired into the system. The VVFL spiral can't run autonomously because:

  1. The scoreboard has no live readings (188 capabilities listed, instruments uncalibrated)
  2. The handoff protocol is a document, not a hook — enforcement is at convention layer (Level 2), not system layer (Level 4)
  3. No automated keep/discard — engineering can't verify whether a change improved the system without human judgment at every step
  4. No settlement verification — when a PRD crosses from dream to engineering and back, no instrument confirms arrival matched promise
  5. The trust ledger doesn't exist yet — settlement history per agent per boundary isn't tracked

Karpathy proved: three files + one metric + "NEVER STOP" = 100+ autonomous experiments overnight. The primitives exist. Claude Code now ships hooks, custom agents, agent teams, /batch, and agent memory with project scope. What's missing is the wiring.

2. Architecture: A&ID for Phygital Systems

2.1 The Pattern

In a dairy factory P&ID:

P&ID ElementFunction
Agent (pump, motor, actuator)Converts energy into action — actuates intent
Instrument (sensor, gauge, valve, controller)Reads state, carries control signals, verifies settlement
PipeInfrastructure connecting agents and instruments
LoopAgent + instrument + pipe + feedback = closed system

In the phygital mycelium A&ID — same pattern, different substrate:

A&ID ElementFunctionExamples
Agent (phygital being)Biological + digital composite that actuates intentMatt + Nav. Displaced worker + Stackmates agent. NOT human alone, NOT AI alone
InstrumentCarries energy AND instructions between agents — the "intelligent hyperlink"Smart contract, type contract, API boundary, commissioning receipt, financial instrument
Infrastructure (mycelium)Protocol network connecting agents via instrumentsBlockchain settlement, API boundaries, shared libs, Supabase, the whole stack
LoopAgent → instrument → agent → settlement → feedbackClosed system where trust compounds or corrects

Critical distinction: In P&ID, instruments primarily measure. In A&ID, instruments are active participants — they carry intent, energy, conditions, and reputation simultaneously. A smart contract doesn't just measure settlement. It holds the energy in escrow, encodes conditions, releases on fulfillment, and records to the trust ledger. Simultaneously pipe, valve, and gauge.

2.2 Two Hemispheres, One Protocol

mm.dreamineering.com                stackmates repo
┌──────────────────────┐ ┌──────────────────────┐
│ DREAM HEMISPHERE │ │ ENGINEERING HEMISPHERE│
│ │ │ │
│ Strategy │ PROTOCOL │ 9 algorithms │
│ Stories │ BOUNDARY │ 208 tables │
│ Priorities │◄──────────►│ Hex architecture │
│ Scoreboard │ │ Venture code │
│ Navigation System │ │ Shared infra │
│ 400+ pages of │ │ Agent comms │
│ collective wisdom │ │ Commissioning data │
│ │ │ │
│ = program.md │ │ = train.py │
│ at scale │ │ at scale │
└──────────────────────┘ └──────────────────────┘
│ │
└───────────┐ ┌─────────────┘
▼ ▼
┌──────────────────────┐
│ SCOREBOARD │
│ reads BOTH sides │
│ = prepare.py │
│ (locked eval) │
└──────────────────────┘

Rule: No agent crosses the hemisphere boundary without going through the protocol. Dream agents don't write code. Engineering agents don't rewrite strategy. The protocol IS the trust boundary — same role as Karpathy's locked prepare.py.

Karpathy mapping:

AutoresearchDreamineeringScale
prepare.py (locked trust boundary)libs/agency/algorithms + shared infra9 algorithms, 208 tables
train.py (single mutable file per experiment)Venture-specific code / PRD spec.mdPer-venture, per-PRD
program.md (human writes strategy, agent executes)mm.dreamineering.com (entire site)400+ pages of directive strategy
val_bpb (single unambiguous metric)Settlement integrity at each boundaryThree bearings (Section 3)
git branch + revert on failureGit-based, needs automated revertHook-enforced (Phase 3)
"NEVER STOP"TeammateIdle hook with exit code 2Phase 3

2.3 The Agent x Agent Matrix

The dominant future boundary for goodwill flow is agent-to-agent — where agents are phygital beings operating at different ratios of biological to digital.

Agent \ AgentBiologicalPhygital BeingRobotPure Digital
BiologicalHandshake, reputationHuman hires phygital serviceHuman operates machineHuman calls API
Phygital BeingAgent represents humanDOMINANT BOUNDARY — agent-to-agent commercePhygital directs fleetPhygital orchestrates services
RobotMachine serves humanRobot executes phygital intentMachine-to-machineRobot consumes digital service
Pure DigitalSoftware serves humanSoftware serves phygital beingSoftware controls robotAPI-to-API settlement

The center cell (phygital ↔ phygital) is where most future commerce lives. The instruments between them carry:

What the Instrument CarriesFunction
Financial energyPayment, escrow, streaming yield
InstructionsConditions for release, quality thresholds, SLAs
Reputation dataSettlement history, trust score, endorsement chain
IntentWhat the sending agent wants the receiving agent to accomplish

3. Goodwill Flow — The Northstar Instrument

3.1 Why Not Cashflow

Developer Anarchy (Fred George) used cashflow as northstar. Works when every team is a revenue experiment. But cashflow measures extraction — "did we capture value?" Capitalism rewards aggregation. That's not the game.

Goodwill is the only capital that compounds without limit. But you can't measure accumulation — that's back to wealth thinking. You measure flow rate — is capital converting through the cycle?

Financial → Time → Intellectual → Social → Goodwill → Financial
↑___________________________________________________|

3.2 Goodwill = Settlement Integrity at Every Boundary

Goodwill flows between agents through instruments in closed loops:

AGENT A                    INSTRUMENT                     AGENT B
(phygital being) (smart contract / financial (phygital being)
instrument / type contract)

Intent ─────────► Encodes intent + energy ──────────► Receives

Trust ledger ◄─── Settlement confirmation ◄─────────── Actuates
updates intent

Goodwill ◄──────── Did arrival match promise? ◄────── Verifies
compounds The INSTRUMENT reads this settlement

The receiver validates, never the sender. Commissioning principle: "the builder never validates their own work."

3.3 Three Bearings

Not five — keep it tight enough for binary keep/discard.

BearingWhat It ReadsBoundaryComputed From
Internal SettlementDo components settle cleanly between agents within engineering?Type contracts, API boundaries, test assertionsTests pass, types hold, no defensive code. PostToolUse hook computes
Hemisphere SettlementDoes what's built match what's dreamed? Does feedback elevate next dream?PRD spec ↔ commissioning resultGap between acceptance criteria and test coverage. Protocol layer computes
External SettlementDoes what's promised work when a stranger's agent applies it?Published standards, scaffolded ventures, recommendationsStandards adopted, ventures operating, referrals that stick. Analytics + venture telemetry

Each bearing asks the same question: did what arrived match what was promised, and did the receiver's trust compound?

3.4 Settlement at Every Scale

ScaleAgent AInstrumentAgent BSettlement = Goodwill?
CodeShared libType contract + API boundaryVenture codeTypes hold, no defensive code needed
Inner loopNav (engineering)Commissioning receiptDream team (outer loop)Built matches specced
PlatformStackmates agentService agreement + smart contractVenture founder's agentInfra works, venture operates
CommerceSeller's phygital beingSmart contract (payment + conditions + reputation)Buyer's phygital beingProduct matches endorsement, trust compounds
NetworkVenture A's agentShared standard on myceliumVenture B's agentStandard works for both, adoption compounds
RecommendationTrusted figure's agentEndorsement (reputation instrument)Prospect's agentProduct matches endorsement — or trust ledger adjusts downward

3.5 Where the Instrument Lives

The instrument reads from both hemispheres — it sits in the protocol layer:

  • Internal Settlement — computed from git commit data + test results (stackmates repo)
  • Hemisphere Settlement — computed from PRD frontmatter (mm repo) x commissioning dates (stackmates repo)
  • External Settlement — computed from published standards (mm repo) x package consumers + venture telemetry (stackmates repo)

3.6 The Calibration Path: Supabase → Sui

You don't put an untested instrument on the plant floor. Bench-test, calibrate, prove it reads true, THEN install in the live system.

Supabase (JSON/SQL)        →  Proven patterns         →  Sui (on-chain)
Installation + Testing Operational Optimization
"Does the loop close?" "Does settlement compound?" "Does it scale trustlessly?"
What Exists in SupabaseWhat It Becomes on SuiCalibration Test
PRD record with commissioning statusSmart contract: intent + conditions + energy escrowDoes Supabase record accurately predict commissioning outcome?
Agent receipt (JSON)Settlement object owned by receiving agentDoes receipt match what was actually delivered?
Trust score (computed from settlement history)Identity object (pepeha) with provenance chainDoes accumulated trust data correlate with actual agent reliability?
Venture registryVenture object spawned from mycelium templateCan a new venture scaffold using only the Supabase-proven patterns?

Sui's object-centric model is the target because identity isn't a row in a database — it's a first-class object that owns other objects. The pepeha isn't metadata. It's the root object from which everything inherits trust.

Pepeha LayerSupabase (now)Sui (when earned)
Who you areUser record with rolesObject with owned capabilities
Where you come fromSettlement history as rowsProvenance chain — every transaction, immutable
Why you're hereIntent declared in PRD frontmatterIntent encoded in smart contract conditions
Where you're goingPriority score, roadmapStaked commitment — energy locked to declared intent
Who's onboardTeam assignmentsCo-signed objects — trust is structural, not claimed

The migration isn't a technical project. It's a commissioning gate. When Supabase data honestly reflects reality through clean settlement patterns across multiple ventures, the Sui migration is a formality. The hard work is the calibration.

4. Gap Matrices

Matrix thinking: show the gaps, don't explain them. The ? cells are the work.

4.1 Layer x Status

LayerDream SideProtocolEngineering Side
MemoryClaude.ai memories (partial)WM-NAV (Phase 0-4 done)CLAUDE.md + auto-memory
HandoffPRD spec.md authoredHandoff protocol (doc exists, not enforced)Nav reads spec, builds
Feedback?Commissioning updates (partial Convex comms)Receipts exist
InstrumentsScoreboard pages (calibrating)?Feature-matrix (188 features)
Loop Control??Hooks (stop-uncommitted only)
Autonomy??/orch-plan + /orch-work (manual trigger)
Settlement Verification???
Trust Ledger???

Eight ? cells in the protocol column. Those are the highest-leverage work items.

4.2 Claude Code Feature x Hemisphere

FeatureDream (mm)Protocol (bridge)Engineering (stackmates)
CLAUDE.mdSite architecturePRD contract existsHex rules, CDD
Skills/content-flow, /copywriting? /commissioning skill/orch-plan, /orch-work
HooksContent quality (partial)? PostToolUse → scoreboardstop-uncommitted only
Custom Agents? Dream agent definition? Comms agent? Engineering agent definition
Agent Teams? Narrative teamN/A? Builder/verifier/simplifier
Agent Memory?? Shared project scopeWM-NAV (file-based)
/batch? Retrofit pages at scaleN/A? Venture scaffolding
/simplifyN/AN/AAvailable (not in loop)

4.3 Commissioning Level x Scoreboard Section

SectionCurrent LevelGap to L3Gap to L4
Reality (instruments)L2 — pages exist, instruments listedWire live readings from feature-matrixIndependent verification of readings
Strategy (positioning)L1 — one page, no instrumentsAdd strategy-level gauges (ventures with validated demand)External validation of positioning
Truth-SeekingL2 — protocol defined, evidence hierarchy existsTrack evidence levels per decision systematicallyProtocol applied to itself — catch Claude assumptions
Financial metricsL2 — consolidated into parent as table rowsConnect to live Supabase dataMetrics computed from live data sources
CommissioningL2 — strongest page in section, but reads stale dataWire to feature-matrix as source of truthBuilder never validates own work — outer loop confirms

5. Execution Plan

Follows commissioning model: Installation → Testing → Operational → Optimization. Follows think-slow-act-fast: this spec IS the slow thinking. The phases below are fast execution. Each phase's gate must pass before starting the next.

Phase 1: Wire the Scoreboard (Instrument Calibration)

Why first: Can't run a loop without instruments that read true. The scoreboard's own instruments are uncalibrated. This is building prepare.py — the locked evaluation function.

Goodwill bearing served: Internal Settlement.

#TaskRepoEffortAcceptance Criteria
1.1Fix stale numbers on scoreboard reality pagemm30 minNumbers match feature-matrix source
1.2Add "Current Reading" section to reality/index.md — manually maintained snapshot from feature-matrixmm1 hrPage shows: total features, % coverage, count at each L-level
1.3Financial stubs already consolidated into parent as table rowsmm0Already done — no collapse needed
1.4Create PostToolUse hook: on any commit to stackmates, compute composition ratio and log to .ai/scoreboard-readings/YYYY-MM-DD.jsonstackmates2 hrJSON file appears after each commit
1.5Create SessionStart hook: read latest scoreboard reading, include in nav-session-boot.mjs briefingstackmates1 hrNav starts every session seeing last composition ratio
1.6Create /goodwill command: prints current readings for all three bearingsstackmates1 hrRunning /goodwill shows a truthful bearing panel

Success criterion: Nav can run /goodwill and see numbers that reflect reality.

Phase 2: Wire the Protocol (Dream ↔ Engineering Settlement)

Why second: With instruments reading true, the protocol between hemispheres can be verified.

Goodwill bearing served: Hemisphere Settlement.

#TaskRepoEffortAcceptance Criteria
2.1Create /commissioning skill: reads PRD frontmatter, outputs next action based on phasemm2 hrNav invokes skill, gets unambiguous next action for current PRD
2.2Create PreToolUse hook: block writes to libs/agency/algorithms/ without [COMMISSION] flagstackmates1 hrShared infra modification without flag → blocked
2.3Create TaskCompleted hook: update PRD frontmatter + append to commissioning log + write settlement receipt to Convexstackmates2 hrReceipt appears in Convex, PRD frontmatter updated
2.4Update nav-session-boot.mjs: read pending commissioning updates from agent commsstackmates1 hrBriefing shows dream hemisphere changes since last session
2.5Define engineering agent: .claude/agents/engineering.mdstackmates30 minAgent definition file exists with explicit tool boundaries
2.6Create settlement verification: receiving side confirms receipt against spec acceptance criteria, logs to Supabaseprotocol layer2 hrEvery settlement has two records: sender's receipt + receiver's verification
2.7Create settlement history table in Supabase: agent_settlementsstackmates1 hrSettlement data persists across sessions

Phase 3: Wire the Loop Control (NEVER STOP)

#TaskRepoEffortAcceptance Criteria
3.1Create /loop command: reads PRD queue, picks top uncommissioned, runs plan → work → verify → commission or revertstackmates3 hr/loop picks highest-priority PRD, plans, executes, verifies
3.2Add timebox to /orch-work: each task gets budget (default 30 min)stackmates1 hrTask exceeding budget is committed as partial with log
3.3Create TeammateIdle hook: exit code 2 → read PRD queue → assign next taskstackmates1 hrIdle teammate receives new task automatically
3.4Create PostToolUse hook: if composition ratio regresses → git revert HEAD + log failurestackmates2 hrRegression → automatic revert + failure logged
3.5Create /evolve-mantra command: session-end condensation of key decisions + outcomesstackmates1 hrSession learnings persist
3.6Compute cycle velocity: PRD inception → commissioning "Operational" timestampprotocol layer1 hrHemisphere Settlement bearing has a number

Phase 4: Wire Agent Teams (Parallel Commissioning)

#TaskRepoEffortAcceptance Criteria
4.1Enable agent teams feature flagstackmates5 minFlag set
4.2Create engineering team: Builder + Verifier + Simplifierstackmates2 hrThree-role team with file ownership
4.3Create dream team: Strategist + Copywriter + Devil's Advocatemm2 hrDream hemisphere can parallelize
4.4Wire /batch for venture scaffoldingstackmates4 hrNew venture decomposes into parallel units
4.5Create cross-hemisphere Lead agentprotocol layer4 hrLead coordinates across hemispheres

Phase 5: Wire the Trust Ledger (Supabase Calibration → Sui Readiness)

#TaskRepoEffortAcceptance Criteria
5.1Define instrument schemas for agent-to-agent boundariesprotocol layer3 hrSchema covers intent, conditions, energy, reputation, settlement
5.2Create settlement verification hook: receiving agent confirmsstackmates2 hrEvery boundary crossing has two-sided record
5.3Trust score computation: integrity rate per agent per boundarystackmates2 hrEach agent has trust score per boundary
5.4Trust ledger dashboard on scoreboard reality pagemm2 hrScoreboard shows agent settlement integrity
5.5Calibration validation: trust scores vs ground truthprotocol layer4 hrTrust score predicts outcomes with >70% accuracy
5.6Sui object model draft: Map Supabase schemas to Move structsdesign doc only4 hrMove structs mirror proven Supabase patterns

6. Build Sequence Summary

OrderPhaseDurationGateProves
1Wire ScoreboardDay 1-2Numbers match realitySystem can read its own position
2Wire ProtocolDay 3-5PRD cycles produce verified settlementsAgents can settle across hemisphere boundary
3Wire Loop ControlDay 6-82hr unattended run, >50% keep rateSystem iterates autonomously
4Wire Agent TeamsDay 9-12Venture scaffolded in <4hr parallelSystem scales horizontally
5Wire Trust LedgerDay 13-1850+ verified settlements, trust predicts outcomesSettlement data earns right to go on-chain

What Nav Should NOT Do

  • Don't build the trust ledger dashboard before Phase 1 instruments read true
  • Don't enable Agent Teams before single-agent /loop works
  • Don't write new scoreboard pages. Fix existing ones
  • Don't modify shared infra (algorithms, hex architecture) as part of this work
  • Don't deploy to Sui before Supabase trust data is validated against reality

What the Dream Hemisphere Will Do in Parallel

  • Review scoreboard readings against source data (commissioning gate for Phase 1)
  • Author next PRD in priority queue with scoring
  • Update strategy page with live positioning data when instruments exist
  • Verify settlement receipts from engineering (receiver validates, never sender)

7. Claude Code Feature Mapping

7.1 The Four-Layer Stack → Mantra Hierarchy

Mantra LevelClaude Code LayerPhaseApplication
System (prevents error)Hooks (deterministic, fires automatically)1-3PostToolUse → compute settlement score. PreToolUse → protect shared infra. TeammateIdle → NEVER STOP. TaskCompleted → update trust ledger
Hook (fires automatically)Skills (auto-discovered .md + supporting files)2/commissioning skill = procedural memory for handoff protocol
Rule (context loaded, applied)Commands (explicit invocation)1-3/loop (autonomous iteration), /goodwill (read bearings), /evolve-mantra (session-end learning)
Mantra (remembers phrase)Agent memory (persists across sessions)2-5memory: project scope in agent frontmatter

7.2 Custom Agent Definitions

Engineering Agent (.claude/agents/engineering.md):

  • Full write access: venture-specific code, tests, commissioning data
  • Read-only: libs/agency/algorithms/, shared infrastructure
  • To modify shared infra: commit message must include [COMMISSION]
  • Every task completion produces a receipt
  • The receiving agent (dream team) verifies settlement

Dream Agent (.claude/agents/dream.md):

  • Read-only on stackmates repo
  • Write access: mm repo pages, PRD specs, strategy updates
  • Verifies settlements — never generates them
  • When engineering sends a commissioning receipt: verify against spec

8. The Telco MEV Connection

In telco routing:

CALLER (intent) → ROUTING ALGORITHM → INFRASTRUCTURE → CDR (settlement) → BILLING (feedback)

MEV extracts value from the gap between intent and settlement — the router takes a cut. The Dreamineering thesis inverts this: minimize the gap so both agents compound trust. MEV isn't extracted by a middleman. It's returned to the network as goodwill — making the network more valuable, attracting more agents, creating more settlement opportunities.

That's DePIN. That's the tokenomics. The network's value IS the accumulated settlement integrity of every agent-to-agent transaction. The token doesn't represent equity in a company. It represents a claim on the trust ledger of the network.

The Supabase calibration (Phase 5) is proving this thesis with data before encoding it on-chain. Same pattern: understand the decision tree better than veterans, build the spreadsheet proof, earn the right to build the system.

9. Questions

If the scoreboard commissions itself to L4 using this plan, is that the first capability to reach L4 — and is that the story that proves the system works?

  • When /loop runs unattended for 2 hours with >50% keep rate, what does that tell you about human judgment in the inner loop — and what does it mean for the displaced workers thesis?
  • The trust score is computed from settlement integrity. But settlement is binary (arrived = promised, or didn't). Where does the nuance live? Is "partially matched" a thing, or does the instrument need to be crisp enough that every settlement is yes/no?
  • When the first external venture founder scaffolds on the mycelium and their agent settles with Stackmates agents for the first time — that's the External Settlement bearing's first reading. What does a healthy first reading look like?
  • The instrument carries intent, energy, conditions, and reputation simultaneously. In the Supabase prototype, these are columns in a table. On Sui, they're fields in a Move struct. Is there an intermediate form — something more structured than JSON but less permanent than on-chain — that accelerates the calibration?
  • Goodwill flows like water — finding every crack, going around, through, under the rock. If the trust ledger shows a boundary with low settlement integrity, does goodwill route around it automatically? Or does the system need explicit rerouting logic? Is that the explore-exploit algorithm applied to trust?

Context