← Hemisphere Bridge → Agent Platform Phase 4 · Prompt Deck · Pictures

Hemisphere Bridge Spec v2

From: Outer Loop (Claude.ai) → Engineering Team (Nav) Date: 2026-03-11 Status: Draft — ready for engineering review Depends on: Nav Continuity Layer (Phase 0-4 complete), PRD Handoff Protocol, Agent Comms (Convex) Northstar: Every cycle of dream → spec → build → commission → dream-bigger leaves both hemispheres measurably stronger, takes less time, and the trust ledger confirms it.

1. Problem Statement

The dream hemisphere (mm.dreamineering.com) and the engineering hemisphere (stackmates) communicate through manual handoffs, stale instrument readings, and a protocol that exists on paper but isn't wired into the system. The VVFL spiral can't run autonomously because:

The scoreboard has no live readings (188 capabilities listed, instruments uncalibrated)
The handoff protocol is a document, not a hook — enforcement is at convention layer (Level 2), not system layer (Level 4)
No automated keep/discard — engineering can't verify whether a change improved the system without human judgment at every step
No settlement verification — when a PRD crosses from dream to engineering and back, no instrument confirms arrival matched promise
The trust ledger doesn't exist yet — settlement history per agent per boundary isn't tracked

Karpathy proved: three files + one metric + "NEVER STOP" = 100+ autonomous experiments overnight. The primitives exist. Claude Code now ships hooks, custom agents, agent teams, /batch, and agent memory with project scope. What's missing is the wiring.

2. Architecture: A&ID for Phygital Systems

2.1 The Pattern

In a dairy factory P&ID:

P&ID Element	Function
Agent (pump, motor, actuator)	Converts energy into action — actuates intent
Instrument (sensor, gauge, valve, controller)	Reads state, carries control signals, verifies settlement
Pipe	Infrastructure connecting agents and instruments
Loop	Agent + instrument + pipe + feedback = closed system

In the phygital mycelium A&ID — same pattern, different substrate:

A&ID Element	Function	Examples
Agent (phygital being)	Biological + digital composite that actuates intent	Matt + Nav. Displaced worker + Stackmates agent. NOT human alone, NOT AI alone
Instrument	Carries energy AND instructions between agents — the "intelligent hyperlink"	Smart contract, type contract, API boundary, commissioning receipt, financial instrument
Infrastructure (mycelium)	Protocol network connecting agents via instruments	Blockchain settlement, API boundaries, shared libs, Supabase, the whole stack
Loop	Agent → instrument → agent → settlement → feedback	Closed system where trust compounds or corrects

Critical distinction: In P&ID, instruments primarily measure. In A&ID, instruments are active participants — they carry intent, energy, conditions, and reputation simultaneously. A smart contract doesn't just measure settlement. It holds the energy in escrow, encodes conditions, releases on fulfillment, and records to the trust ledger. Simultaneously pipe, valve, and gauge.

2.2 Two Hemispheres, One Protocol

mm.dreamineering.com                stackmates repo
┌──────────────────────┐            ┌──────────────────────┐
│   DREAM HEMISPHERE   │            │ ENGINEERING HEMISPHERE│
│                      │            │                      │
│  Strategy            │  PROTOCOL  │  9 algorithms        │
│  Stories             │  BOUNDARY  │  208 tables           │
│  Priorities          │◄──────────►│  Hex architecture    │
│  Scoreboard          │            │  Venture code        │
│  Navigation System   │            │  Shared infra        │
│  400+ pages of       │            │  Agent comms         │
│  collective wisdom   │            │  Commissioning data  │
│                      │            │                      │
│  = program.md        │            │  = train.py          │
│    at scale          │            │    at scale          │
└──────────────────────┘            └──────────────────────┘
          │                                    │
          └───────────┐          ┌─────────────┘
                      ▼          ▼
               ┌──────────────────────┐
               │    SCOREBOARD        │
               │    reads BOTH sides  │
               │    = prepare.py      │
               │      (locked eval)   │
               └──────────────────────┘

Rule: No agent crosses the hemisphere boundary without going through the protocol. Dream agents don't write code. Engineering agents don't rewrite strategy. The protocol IS the trust boundary — same role as Karpathy's locked prepare.py.

Karpathy mapping:

Autoresearch	Dreamineering	Scale
`prepare.py` (locked trust boundary)	libs/agency/algorithms + shared infra	9 algorithms, 208 tables
`train.py` (single mutable file per experiment)	Venture-specific code / PRD spec.md	Per-venture, per-PRD
`program.md` (human writes strategy, agent executes)	mm.dreamineering.com (entire site)	400+ pages of directive strategy
`val_bpb` (single unambiguous metric)	Settlement integrity at each boundary	Three bearings (Section 3)
`git branch + revert on failure`	Git-based, needs automated revert	Hook-enforced (Phase 3)
`"NEVER STOP"`	TeammateIdle hook with exit code 2	Phase 3

2.3 The Agent x Agent Matrix

The dominant future boundary for goodwill flow is agent-to-agent — where agents are phygital beings operating at different ratios of biological to digital.

Agent \ Agent	Biological	Phygital Being	Robot	Pure Digital
Biological	Handshake, reputation	Human hires phygital service	Human operates machine	Human calls API
Phygital Being	Agent represents human	DOMINANT BOUNDARY — agent-to-agent commerce	Phygital directs fleet	Phygital orchestrates services
Robot	Machine serves human	Robot executes phygital intent	Machine-to-machine	Robot consumes digital service
Pure Digital	Software serves human	Software serves phygital being	Software controls robot	API-to-API settlement

The center cell (phygital ↔ phygital) is where most future commerce lives. The instruments between them carry:

What the Instrument Carries	Function
Financial energy	Payment, escrow, streaming yield
Instructions	Conditions for release, quality thresholds, SLAs
Reputation data	Settlement history, trust score, endorsement chain
Intent	What the sending agent wants the receiving agent to accomplish

3. Goodwill Flow — The Northstar Instrument

3.1 Why Not Cashflow

Developer Anarchy (Fred George) used cashflow as northstar. Works when every team is a revenue experiment. But cashflow measures extraction — "did we capture value?" Capitalism rewards aggregation. That's not the game.

Goodwill is the only capital that compounds without limit. But you can't measure accumulation — that's back to wealth thinking. You measure flow rate — is capital converting through the cycle?

Financial → Time → Intellectual → Social → Goodwill → Financial
         ↑___________________________________________________|

3.2 Goodwill = Settlement Integrity at Every Boundary

Goodwill flows between agents through instruments in closed loops:

AGENT A                    INSTRUMENT                     AGENT B
(phygital being)     (smart contract / financial       (phygital being)
                      instrument / type contract)

  Intent ─────────► Encodes intent + energy ──────────► Receives
                                                         │
  Trust ledger ◄─── Settlement confirmation ◄─────────── Actuates
  updates                                                intent
                                                         │
  Goodwill ◄──────── Did arrival match promise? ◄────── Verifies
  compounds           The INSTRUMENT reads this          settlement

The receiver validates, never the sender. Commissioning principle: "the builder never validates their own work."

3.3 Three Bearings

Not five — keep it tight enough for binary keep/discard.

Bearing	What It Reads	Boundary	Computed From
Internal Settlement	Do components settle cleanly between agents within engineering?	Type contracts, API boundaries, test assertions	Tests pass, types hold, no defensive code. PostToolUse hook computes
Hemisphere Settlement	Does what's built match what's dreamed? Does feedback elevate next dream?	PRD spec ↔ commissioning result	Gap between acceptance criteria and test coverage. Protocol layer computes
External Settlement	Does what's promised work when a stranger's agent applies it?	Published standards, scaffolded ventures, recommendations	Standards adopted, ventures operating, referrals that stick. Analytics + venture telemetry

Each bearing asks the same question: did what arrived match what was promised, and did the receiver's trust compound?

3.4 Settlement at Every Scale

Scale	Agent A	Instrument	Agent B	Settlement = Goodwill?
Code	Shared lib	Type contract + API boundary	Venture code	Types hold, no defensive code needed
Inner loop	Nav (engineering)	Commissioning receipt	Dream team (outer loop)	Built matches specced
Platform	Stackmates agent	Service agreement + smart contract	Venture founder's agent	Infra works, venture operates
Commerce	Seller's phygital being	Smart contract (payment + conditions + reputation)	Buyer's phygital being	Product matches endorsement, trust compounds
Network	Venture A's agent	Shared standard on mycelium	Venture B's agent	Standard works for both, adoption compounds
Recommendation	Trusted figure's agent	Endorsement (reputation instrument)	Prospect's agent	Product matches endorsement — or trust ledger adjusts downward

3.5 Where the Instrument Lives

The instrument reads from both hemispheres — it sits in the protocol layer:

Internal Settlement — computed from git commit data + test results (stackmates repo)
Hemisphere Settlement — computed from PRD frontmatter (mm repo) x commissioning dates (stackmates repo)
External Settlement — computed from published standards (mm repo) x package consumers + venture telemetry (stackmates repo)

3.6 The Calibration Path: Supabase → Sui

You don't put an untested instrument on the plant floor. Bench-test, calibrate, prove it reads true, THEN install in the live system.

Supabase (JSON/SQL)        →  Proven patterns         →  Sui (on-chain)
Installation + Testing        Operational                 Optimization
"Does the loop close?"        "Does settlement compound?" "Does it scale trustlessly?"

What Exists in Supabase	What It Becomes on Sui	Calibration Test
PRD record with commissioning status	Smart contract: intent + conditions + energy escrow	Does Supabase record accurately predict commissioning outcome?
Agent receipt (JSON)	Settlement object owned by receiving agent	Does receipt match what was actually delivered?
Trust score (computed from settlement history)	Identity object (pepeha) with provenance chain	Does accumulated trust data correlate with actual agent reliability?
Venture registry	Venture object spawned from mycelium template	Can a new venture scaffold using only the Supabase-proven patterns?

Sui's object-centric model is the target because identity isn't a row in a database — it's a first-class object that owns other objects. The pepeha isn't metadata. It's the root object from which everything inherits trust.

Pepeha Layer	Supabase (now)	Sui (when earned)
Who you are	User record with roles	Object with owned capabilities
Where you come from	Settlement history as rows	Provenance chain — every transaction, immutable
Why you're here	Intent declared in PRD frontmatter	Intent encoded in smart contract conditions
Where you're going	Priority score, roadmap	Staked commitment — energy locked to declared intent
Who's onboard	Team assignments	Co-signed objects — trust is structural, not claimed

The migration isn't a technical project. It's a commissioning gate. When Supabase data honestly reflects reality through clean settlement patterns across multiple ventures, the Sui migration is a formality. The hard work is the calibration.

4. Gap Matrices

Matrix thinking: show the gaps, don't explain them. The ? cells are the work.

4.1 Layer x Status

Layer	Dream Side	Protocol	Engineering Side
Memory	Claude.ai memories (partial)	WM-NAV (Phase 0-4 done)	CLAUDE.md + auto-memory
Handoff	PRD spec.md authored	Handoff protocol (doc exists, not enforced)	Nav reads spec, builds
Feedback	`?`	Commissioning updates (partial Convex comms)	Receipts exist
Instruments	Scoreboard pages (calibrating)	`?`	Feature-matrix (188 features)
Loop Control	`?`	`?`	Hooks (stop-uncommitted only)
Autonomy	`?`	`?`	`/orch-plan` + `/orch-work` (manual trigger)
Settlement Verification	`?`	`?`	`?`
Trust Ledger	`?`	`?`	`?`

Eight ? cells in the protocol column. Those are the highest-leverage work items.

4.2 Claude Code Feature x Hemisphere

Feature	Dream (mm)	Protocol (bridge)	Engineering (stackmates)
CLAUDE.md	Site architecture	PRD contract exists	Hex rules, CDD
Skills	/content-flow, /copywriting	`?` `/commissioning` skill	/orch-plan, /orch-work
Hooks	Content quality (partial)	`?` PostToolUse → scoreboard	stop-uncommitted only
Custom Agents	`?` Dream agent definition	`?` Comms agent	`?` Engineering agent definition
Agent Teams	`?` Narrative team	N/A	`?` Builder/verifier/simplifier
Agent Memory	`?`	`?` Shared project scope	WM-NAV (file-based)
`/batch`	`?` Retrofit pages at scale	N/A	`?` Venture scaffolding
`/simplify`	N/A	N/A	Available (not in loop)

4.3 Commissioning Level x Scoreboard Section

Section	Current Level	Gap to L3	Gap to L4
Reality (instruments)	L2 — pages exist, instruments listed	Wire live readings from feature-matrix	Independent verification of readings
Strategy (positioning)	L1 — one page, no instruments	Add strategy-level gauges (ventures with validated demand)	External validation of positioning
Truth-Seeking	L2 — protocol defined, evidence hierarchy exists	Track evidence levels per decision systematically	Protocol applied to itself — catch Claude assumptions
Financial metrics	L2 — consolidated into parent as table rows	Connect to live Supabase data	Metrics computed from live data sources
Commissioning	L2 — strongest page in section, but reads stale data	Wire to feature-matrix as source of truth	Builder never validates own work — outer loop confirms

5. Execution Plan

Follows commissioning model: Installation → Testing → Operational → Optimization. Follows think-slow-act-fast: this spec IS the slow thinking. The phases below are fast execution. Each phase's gate must pass before starting the next.

Phase 1: Wire the Scoreboard (Instrument Calibration)

Why first: Can't run a loop without instruments that read true. The scoreboard's own instruments are uncalibrated. This is building prepare.py — the locked evaluation function.

Goodwill bearing served: Internal Settlement.

#	Task	Repo	Effort	Acceptance Criteria
1.1	Fix stale numbers on scoreboard reality page	mm	30 min	Numbers match feature-matrix source
1.2	Add "Current Reading" section to `reality/index.md` — manually maintained snapshot from feature-matrix	mm	1 hr	Page shows: total features, % coverage, count at each L-level
1.3	Financial stubs already consolidated into parent as table rows	mm	0	Already done — no collapse needed
1.4	Create PostToolUse hook: on any commit to stackmates, compute composition ratio and log to `.ai/scoreboard-readings/YYYY-MM-DD.json`	stackmates	2 hr	JSON file appears after each commit
1.5	Create SessionStart hook: read latest scoreboard reading, include in nav-session-boot.mjs briefing	stackmates	1 hr	Nav starts every session seeing last composition ratio
1.6	Create `/goodwill` command: prints current readings for all three bearings	stackmates	1 hr	Running `/goodwill` shows a truthful bearing panel

Success criterion: Nav can run /goodwill and see numbers that reflect reality.

Phase 2: Wire the Protocol (Dream ↔ Engineering Settlement)

Why second: With instruments reading true, the protocol between hemispheres can be verified.

Goodwill bearing served: Hemisphere Settlement.

#	Task	Repo	Effort	Acceptance Criteria
2.1	Create `/commissioning` skill: reads PRD frontmatter, outputs next action based on phase	mm	2 hr	Nav invokes skill, gets unambiguous next action for current PRD
2.2	Create PreToolUse hook: block writes to `libs/agency/algorithms/` without `[COMMISSION]` flag	stackmates	1 hr	Shared infra modification without flag → blocked
2.3	Create TaskCompleted hook: update PRD frontmatter + append to commissioning log + write settlement receipt to Convex	stackmates	2 hr	Receipt appears in Convex, PRD frontmatter updated
2.4	Update nav-session-boot.mjs: read pending commissioning updates from agent comms	stackmates	1 hr	Briefing shows dream hemisphere changes since last session
2.5	Define engineering agent: `.claude/agents/engineering.md`	stackmates	30 min	Agent definition file exists with explicit tool boundaries
2.6	Create settlement verification: receiving side confirms receipt against spec acceptance criteria, logs to Supabase	protocol layer	2 hr	Every settlement has two records: sender's receipt + receiver's verification
2.7	Create settlement history table in Supabase: `agent_settlements`	stackmates	1 hr	Settlement data persists across sessions

Phase 3: Wire the Loop Control (NEVER STOP)

#	Task	Repo	Effort	Acceptance Criteria
3.1	Create `/loop` command: reads PRD queue, picks top uncommissioned, runs plan → work → verify → commission or revert	stackmates	3 hr	`/loop` picks highest-priority PRD, plans, executes, verifies
3.2	Add timebox to `/orch-work`: each task gets budget (default 30 min)	stackmates	1 hr	Task exceeding budget is committed as partial with log
3.3	Create TeammateIdle hook: exit code 2 → read PRD queue → assign next task	stackmates	1 hr	Idle teammate receives new task automatically
3.4	Create PostToolUse hook: if composition ratio regresses → `git revert HEAD` + log failure	stackmates	2 hr	Regression → automatic revert + failure logged
3.5	Create `/evolve-mantra` command: session-end condensation of key decisions + outcomes	stackmates	1 hr	Session learnings persist
3.6	Compute cycle velocity: PRD inception → commissioning "Operational" timestamp	protocol layer	1 hr	Hemisphere Settlement bearing has a number

Phase 4: Wire Agent Teams (Parallel Commissioning)

#	Task	Repo	Effort	Acceptance Criteria
4.1	Enable agent teams feature flag	stackmates	5 min	Flag set
4.2	Create engineering team: Builder + Verifier + Simplifier	stackmates	2 hr	Three-role team with file ownership
4.3	Create dream team: Strategist + Copywriter + Devil's Advocate	mm	2 hr	Dream hemisphere can parallelize
4.4	Wire `/batch` for venture scaffolding	stackmates	4 hr	New venture decomposes into parallel units
4.5	Create cross-hemisphere Lead agent	protocol layer	4 hr	Lead coordinates across hemispheres

Phase 5: Wire the Trust Ledger (Supabase Calibration → Sui Readiness)

#	Task	Repo	Effort	Acceptance Criteria
5.1	Define instrument schemas for agent-to-agent boundaries	protocol layer	3 hr	Schema covers intent, conditions, energy, reputation, settlement
5.2	Create settlement verification hook: receiving agent confirms	stackmates	2 hr	Every boundary crossing has two-sided record
5.3	Trust score computation: integrity rate per agent per boundary	stackmates	2 hr	Each agent has trust score per boundary
5.4	Trust ledger dashboard on scoreboard reality page	mm	2 hr	Scoreboard shows agent settlement integrity
5.5	Calibration validation: trust scores vs ground truth	protocol layer	4 hr	Trust score predicts outcomes with >70% accuracy
5.6	Sui object model draft: Map Supabase schemas to Move structs	design doc only	4 hr	Move structs mirror proven Supabase patterns

6. Build Sequence Summary

Order	Phase	Duration	Gate	Proves
1	Wire Scoreboard	Day 1-2	Numbers match reality	System can read its own position
2	Wire Protocol	Day 3-5	PRD cycles produce verified settlements	Agents can settle across hemisphere boundary
3	Wire Loop Control	Day 6-8	2hr unattended run, >50% keep rate	System iterates autonomously
4	Wire Agent Teams	Day 9-12	Venture scaffolded in <4hr parallel	System scales horizontally
5	Wire Trust Ledger	Day 13-18	50+ verified settlements, trust predicts outcomes	Settlement data earns right to go on-chain

What Nav Should NOT Do

Don't build the trust ledger dashboard before Phase 1 instruments read true
Don't enable Agent Teams before single-agent /loop works
Don't write new scoreboard pages. Fix existing ones
Don't modify shared infra (algorithms, hex architecture) as part of this work
Don't deploy to Sui before Supabase trust data is validated against reality

What the Dream Hemisphere Will Do in Parallel

Review scoreboard readings against source data (commissioning gate for Phase 1)
Author next PRD in priority queue with scoring
Update strategy page with live positioning data when instruments exist
Verify settlement receipts from engineering (receiver validates, never sender)

7. Claude Code Feature Mapping

7.1 The Four-Layer Stack → Mantra Hierarchy

Mantra Level	Claude Code Layer	Phase	Application
System (prevents error)	Hooks (deterministic, fires automatically)	1-3	PostToolUse → compute settlement score. PreToolUse → protect shared infra. TeammateIdle → NEVER STOP. TaskCompleted → update trust ledger
Hook (fires automatically)	Skills (auto-discovered .md + supporting files)	2	`/commissioning` skill = procedural memory for handoff protocol
Rule (context loaded, applied)	Commands (explicit invocation)	1-3	`/loop` (autonomous iteration), `/goodwill` (read bearings), `/evolve-mantra` (session-end learning)
Mantra (remembers phrase)	Agent memory (persists across sessions)	2-5	`memory: project` scope in agent frontmatter

7.2 Custom Agent Definitions

Engineering Agent (.claude/agents/engineering.md):

Full write access: venture-specific code, tests, commissioning data
Read-only: libs/agency/algorithms/, shared infrastructure
To modify shared infra: commit message must include [COMMISSION]
Every task completion produces a receipt
The receiving agent (dream team) verifies settlement

Dream Agent (.claude/agents/dream.md):

Read-only on stackmates repo
Write access: mm repo pages, PRD specs, strategy updates
Verifies settlements — never generates them
When engineering sends a commissioning receipt: verify against spec

8. The Telco MEV Connection

In telco routing:

CALLER (intent) → ROUTING ALGORITHM → INFRASTRUCTURE → CDR (settlement) → BILLING (feedback)

MEV extracts value from the gap between intent and settlement — the router takes a cut. The Dreamineering thesis inverts this: minimize the gap so both agents compound trust. MEV isn't extracted by a middleman. It's returned to the network as goodwill — making the network more valuable, attracting more agents, creating more settlement opportunities.

That's DePIN. That's the tokenomics. The network's value IS the accumulated settlement integrity of every agent-to-agent transaction. The token doesn't represent equity in a company. It represents a claim on the trust ledger of the network.

The Supabase calibration (Phase 5) is proving this thesis with data before encoding it on-chain. Same pattern: understand the decision tree better than veterans, build the spreadsheet proof, earn the right to build the system.

9. Questions

If the scoreboard commissions itself to L4 using this plan, is that the first capability to reach L4 — and is that the story that proves the system works?

When /loop runs unattended for 2 hours with >50% keep rate, what does that tell you about human judgment in the inner loop — and what does it mean for the displaced workers thesis?
The trust score is computed from settlement integrity. But settlement is binary (arrived = promised, or didn't). Where does the nuance live? Is "partially matched" a thing, or does the instrument need to be crisp enough that every settlement is yes/no?
When the first external venture founder scaffolds on the mycelium and their agent settles with Stackmates agents for the first time — that's the External Settlement bearing's first reading. What does a healthy first reading look like?
The instrument carries intent, energy, conditions, and reputation simultaneously. In the Supabase prototype, these are columns in a table. On Sui, they're fields in a Move struct. Is there an intermediate form — something more structured than JSON but less permanent than on-chain — that accelerates the calibration?
Goodwill flows like water — finding every crack, going around, through, under the rock. If the trust ledger shows a boundary with low settlement integrity, does goodwill route around it automatically? Or does the system need explicit rerouting logic? Is that the explore-exploit algorithm applied to trust?

Context

PRD Index — Hemisphere Bridge → Agent Platform Phase 4
Prompt Deck — 5-card pitch
Pictures — Pre-flight maps

1. Problem Statement​

2. Architecture: A&ID for Phygital Systems​

2.1 The Pattern​

2.2 Two Hemispheres, One Protocol​

2.3 The Agent x Agent Matrix​

3. Goodwill Flow — The Northstar Instrument​

3.1 Why Not Cashflow​

3.2 Goodwill = Settlement Integrity at Every Boundary​

3.3 Three Bearings​

3.4 Settlement at Every Scale​

3.5 Where the Instrument Lives​

3.6 The Calibration Path: Supabase → Sui​

4. Gap Matrices​

4.1 Layer x Status​

4.2 Claude Code Feature x Hemisphere​

4.3 Commissioning Level x Scoreboard Section​

5. Execution Plan​

Phase 1: Wire the Scoreboard (Instrument Calibration)​

Phase 2: Wire the Protocol (Dream ↔ Engineering Settlement)​

Phase 3: Wire the Loop Control (NEVER STOP)​

Phase 4: Wire Agent Teams (Parallel Commissioning)​

Phase 5: Wire the Trust Ledger (Supabase Calibration → Sui Readiness)​

6. Build Sequence Summary​

What Nav Should NOT Do​

What the Dream Hemisphere Will Do in Parallel​

7. Claude Code Feature Mapping​

7.1 The Four-Layer Stack → Mantra Hierarchy​

7.2 Custom Agent Definitions​

8. The Telco MEV Connection​

9. Questions​

Context​