← Hemisphere Bridge → Agent Platform Phase 4 · Prompt Deck · Pictures
Hemisphere Bridge Spec v2
From: Outer Loop (Claude.ai) → Engineering Team (Nav) Date: 2026-03-11 Status: Draft — ready for engineering review Depends on: Nav Continuity Layer (Phase 0-4 complete), PRD Handoff Protocol, Agent Comms (Convex) Northstar: Every cycle of dream → spec → build → commission → dream-bigger leaves both hemispheres measurably stronger, takes less time, and the trust ledger confirms it.
1. Problem Statement
The dream hemisphere (mm.dreamineering.com) and the engineering hemisphere (stackmates) communicate through manual handoffs, stale instrument readings, and a protocol that exists on paper but isn't wired into the system. The VVFL spiral can't run autonomously because:
- The scoreboard has no live readings (188 capabilities listed, instruments uncalibrated)
- The handoff protocol is a document, not a hook — enforcement is at convention layer (Level 2), not system layer (Level 4)
- No automated keep/discard — engineering can't verify whether a change improved the system without human judgment at every step
- No settlement verification — when a PRD crosses from dream to engineering and back, no instrument confirms arrival matched promise
- The trust ledger doesn't exist yet — settlement history per agent per boundary isn't tracked
Karpathy proved: three files + one metric + "NEVER STOP" = 100+ autonomous experiments overnight. The primitives exist. Claude Code now ships hooks, custom agents, agent teams, /batch, and agent memory with project scope. What's missing is the wiring.
2. Architecture: A&ID for Phygital Systems
2.1 The Pattern
In a dairy factory P&ID:
| P&ID Element | Function |
|---|---|
| Agent (pump, motor, actuator) | Converts energy into action — actuates intent |
| Instrument (sensor, gauge, valve, controller) | Reads state, carries control signals, verifies settlement |
| Pipe | Infrastructure connecting agents and instruments |
| Loop | Agent + instrument + pipe + feedback = closed system |
In the phygital mycelium A&ID — same pattern, different substrate:
| A&ID Element | Function | Examples |
|---|---|---|
| Agent (phygital being) | Biological + digital composite that actuates intent | Matt + Nav. Displaced worker + Stackmates agent. NOT human alone, NOT AI alone |
| Instrument | Carries energy AND instructions between agents — the "intelligent hyperlink" | Smart contract, type contract, API boundary, commissioning receipt, financial instrument |
| Infrastructure (mycelium) | Protocol network connecting agents via instruments | Blockchain settlement, API boundaries, shared libs, Supabase, the whole stack |
| Loop | Agent → instrument → agent → settlement → feedback | Closed system where trust compounds or corrects |
Critical distinction: In P&ID, instruments primarily measure. In A&ID, instruments are active participants — they carry intent, energy, conditions, and reputation simultaneously. A smart contract doesn't just measure settlement. It holds the energy in escrow, encodes conditions, releases on fulfillment, and records to the trust ledger. Simultaneously pipe, valve, and gauge.
2.2 Two Hemispheres, One Protocol
mm.dreamineering.com stackmates repo
┌──────────────────────┐ ┌───────────────────── ─┐
│ DREAM HEMISPHERE │ │ ENGINEERING HEMISPHERE│
│ │ │ │
│ Strategy │ PROTOCOL │ 9 algorithms │
│ Stories │ BOUNDARY │ 208 tables │
│ Priorities │◄──────────►│ Hex architecture │
│ Scoreboard │ │ Venture code │
│ Navigation System │ │ Shared infra │
│ 400+ pages of │ │ Agent comms │
│ collective wisdom │ │ Commissioning data │
│ │ │ │
│ = program.md │ │ = train.py │
│ at scale │ │ at scale │
└ ──────────────────────┘ └──────────────────────┘
│ │
└───────────┐ ┌─────────────┘
▼ ▼
┌──────────────────────┐
│ SCOREBOARD │
│ reads BOTH sides │
│ = prepare.py │
│ (locked eval) │
└──────────────────────┘
Rule: No agent crosses the hemisphere boundary without going through the protocol. Dream agents don't write code. Engineering agents don't rewrite strategy. The protocol IS the trust boundary — same role as Karpathy's locked prepare.py.
Karpathy mapping:
| Autoresearch | Dreamineering | Scale |
|---|---|---|
prepare.py (locked trust boundary) | libs/agency/algorithms + shared infra | 9 algorithms, 208 tables |
train.py (single mutable file per experiment) | Venture-specific code / PRD spec.md | Per-venture, per-PRD |
program.md (human writes strategy, agent executes) | mm.dreamineering.com (entire site) | 400+ pages of directive strategy |
val_bpb (single unambiguous metric) | Settlement integrity at each boundary | Three bearings (Section 3) |
git branch + revert on failure | Git-based, needs automated revert | Hook-enforced (Phase 3) |
"NEVER STOP" | TeammateIdle hook with exit code 2 | Phase 3 |
2.3 The Agent x Agent Matrix
The dominant future boundary for goodwill flow is agent-to-agent — where agents are phygital beings operating at different ratios of biological to digital.
| Agent \ Agent | Biological | Phygital Being | Robot | Pure Digital |
|---|---|---|---|---|
| Biological | Handshake, reputation | Human hires phygital service | Human operates machine | Human calls API |
| Phygital Being | Agent represents human | DOMINANT BOUNDARY — agent-to-agent commerce | Phygital directs fleet | Phygital orchestrates services |
| Robot | Machine serves human | Robot executes phygital intent | Machine-to-machine | Robot consumes digital service |
| Pure Digital | Software serves human | Software serves phygital being | Software controls robot | API-to-API settlement |
The center cell (phygital ↔ phygital) is where most future commerce lives. The instruments between them carry:
| What the Instrument Carries | Function |
|---|---|
| Financial energy | Payment, escrow, streaming yield |
| Instructions | Conditions for release, quality thresholds, SLAs |
| Reputation data | Settlement history, trust score, endorsement chain |
| Intent | What the sending agent wants the receiving agent to accomplish |
3. Goodwill Flow — The Northstar Instrument
3.1 Why Not Cashflow
Developer Anarchy (Fred George) used cashflow as northstar. Works when every team is a revenue experiment. But cashflow measures extraction — "did we capture value?" Capitalism rewards aggregation. That's not the game.
Goodwill is the only capital that compounds without limit. But you can't measure accumulation — that's back to wealth thinking. You measure flow rate — is capital converting through the cycle?
Financial → Time → Intellectual → Social → Goodwill → Financial
↑___________________________________________________|
3.2 Goodwill = Settlement Integrity at Every Boundary
Goodwill flows between agents through instruments in closed loops:
AGENT A INSTRUMENT AGENT B
(phygital being) (smart contract / financial (phygital being)
instrument / type contract)
Intent ─────────► Encodes intent + energy ──────────► Receives
│
Trust ledger ◄─── Settlement confirmation ◄─────────── Actuates
updates intent
│
Goodwill ◄──────── Did arrival match promise? ◄────── Verifies
compounds The INSTRUMENT reads this settlement
The receiver validates, never the sender. Commissioning principle: "the builder never validates their own work."
3.3 Three Bearings
Not five — keep it tight enough for binary keep/discard.
| Bearing | What It Reads | Boundary | Computed From |
|---|---|---|---|
| Internal Settlement | Do components settle cleanly between agents within engineering? | Type contracts, API boundaries, test assertions | Tests pass, types hold, no defensive code. PostToolUse hook computes |
| Hemisphere Settlement | Does what's built match what's dreamed? Does feedback elevate next dream? | PRD spec ↔ commissioning result | Gap between acceptance criteria and test coverage. Protocol layer computes |
| External Settlement | Does what's promised work when a stranger's agent applies it? | Published standards, scaffolded ventures, recommendations | Standards adopted, ventures operating, referrals that stick. Analytics + venture telemetry |
Each bearing asks the same question: did what arrived match what was promised, and did the receiver's trust compound?
3.4 Settlement at Every Scale
| Scale | Agent A | Instrument | Agent B | Settlement = Goodwill? |
|---|---|---|---|---|
| Code | Shared lib | Type contract + API boundary | Venture code | Types hold, no defensive code needed |
| Inner loop | Nav (engineering) | Commissioning receipt | Dream team (outer loop) | Built matches specced |
| Platform | Stackmates agent | Service agreement + smart contract | Venture founder's agent | Infra works, venture operates |
| Commerce | Seller's phygital being | Smart contract (payment + conditions + reputation) | Buyer's phygital being | Product matches endorsement, trust compounds |
| Network | Venture A's agent | Shared standard on mycelium | Venture B's agent | Standard works for both, adoption compounds |
| Recommendation | Trusted figure's agent | Endorsement (reputation instrument) | Prospect's agent | Product matches endorsement — or trust ledger adjusts downward |
3.5 Where the Instrument Lives
The instrument reads from both hemispheres — it sits in the protocol layer:
- Internal Settlement — computed from git commit data + test results (stackmates repo)
- Hemisphere Settlement — computed from PRD frontmatter (mm repo) x commissioning dates (stackmates repo)
- External Settlement — computed from published standards (mm repo) x package consumers + venture telemetry (stackmates repo)
3.6 The Calibration Path: Supabase → Sui
You don't put an untested instrument on the plant floor. Bench-test, calibrate, prove it reads true, THEN install in the live system.
Supabase (JSON/SQL) → Proven patterns → Sui (on-chain)
Installation + Testing Operational Optimization
"Does the loop close?" "Does settlement compound?" "Does it scale trustlessly?"
| What Exists in Supabase | What It Becomes on Sui | Calibration Test |
|---|---|---|
| PRD record with commissioning status | Smart contract: intent + conditions + energy escrow | Does Supabase record accurately predict commissioning outcome? |
| Agent receipt (JSON) | Settlement object owned by receiving agent | Does receipt match what was actually delivered? |
| Trust score (computed from settlement history) | Identity object (pepeha) with provenance chain | Does accumulated trust data correlate with actual agent reliability? |
| Venture registry | Venture object spawned from mycelium template | Can a new venture scaffold using only the Supabase-proven patterns? |
Sui's object-centric model is the target because identity isn't a row in a database — it's a first-class object that owns other objects. The pepeha isn't metadata. It's the root object from which everything inherits trust.
| Pepeha Layer | Supabase (now) | Sui (when earned) |
|---|---|---|
| Who you are | User record with roles | Object with owned capabilities |
| Where you come from | Settlement history as rows | Provenance chain — every transaction, immutable |
| Why you're here | Intent declared in PRD frontmatter | Intent encoded in smart contract conditions |
| Where you're going | Priority score, roadmap | Staked commitment — energy locked to declared intent |
| Who's onboard | Team assignments | Co-signed objects — trust is structural, not claimed |
The migration isn't a technical project. It's a commissioning gate. When Supabase data honestly reflects reality through clean settlement patterns across multiple ventures, the Sui migration is a formality. The hard work is the calibration.
4. Gap Matrices
Matrix thinking: show the gaps, don't explain them. The ? cells are the work.
4.1 Layer x Status
| Layer | Dream Side | Protocol | Engineering Side |
|---|---|---|---|
| Memory | Claude.ai memories (partial) | WM-NAV (Phase 0-4 done) | CLAUDE.md + auto-memory |
| Handoff | PRD spec.md authored | Handoff protocol (doc exists, not enforced) | Nav reads spec, builds |
| Feedback | ? | Commissioning updates (partial Convex comms) | Receipts exist |
| Instruments | Scoreboard pages (calibrating) | ? | Feature-matrix (188 features) |
| Loop Control | ? | ? | Hooks (stop-uncommitted only) |
| Autonomy | ? | ? | /orch-plan + /orch-work (manual trigger) |
| Settlement Verification | ? | ? | ? |
| Trust Ledger | ? | ? | ? |
Eight ? cells in the protocol column. Those are the highest-leverage work items.
4.2 Claude Code Feature x Hemisphere
| Feature | Dream (mm) | Protocol (bridge) | Engineering (stackmates) |
|---|---|---|---|
| CLAUDE.md | Site architecture | PRD contract exists | Hex rules, CDD |
| Skills | /content-flow, /copywriting | ? /commissioning skill | /orch-plan, /orch-work |
| Hooks | Content quality (partial) | ? PostToolUse → scoreboard | stop-uncommitted only |
| Custom Agents | ? Dream agent definition | ? Comms agent | ? Engineering agent definition |
| Agent Teams | ? Narrative team | N/A | ? Builder/verifier/simplifier |
| Agent Memory | ? | ? Shared project scope | WM-NAV (file-based) |
/batch | ? Retrofit pages at scale | N/A | ? Venture scaffolding |
/simplify | N/A | N/A | Available (not in loop) |
4.3 Commissioning Level x Scoreboard Section
| Section | Current Level | Gap to L3 | Gap to L4 |
|---|---|---|---|
| Reality (instruments) | L2 — pages exist, instruments listed | Wire live readings from feature-matrix | Independent verification of readings |
| Strategy (positioning) | L1 — one page, no instruments | Add strategy-level gauges (ventures with validated demand) | External validation of positioning |
| Truth-Seeking | L2 — protocol defined, evidence hierarchy exists | Track evidence levels per decision systematically | Protocol applied to itself — catch Claude assumptions |
| Financial metrics | L2 — consolidated into parent as table rows | Connect to live Supabase data | Metrics computed from live data sources |
| Commissioning | L2 — strongest page in section, but reads stale data | Wire to feature-matrix as source of truth | Builder never validates own work — outer loop confirms |
5. Execution Plan
Follows commissioning model: Installation → Testing → Operational → Optimization. Follows think-slow-act-fast: this spec IS the slow thinking. The phases below are fast execution. Each phase's gate must pass before starting the next.
Phase 1: Wire the Scoreboard (Instrument Calibration)
Why first: Can't run a loop without instruments that read true. The scoreboard's own instruments are uncalibrated. This is building prepare.py — the locked evaluation function.
Goodwill bearing served: Internal Settlement.
| # | Task | Repo | Effort | Acceptance Criteria |
|---|---|---|---|---|
| 1.1 | Fix stale numbers on scoreboard reality page | mm | 30 min | Numbers match feature-matrix source |
| 1.2 | Add "Current Reading" section to reality/index.md — manually maintained snapshot from feature-matrix | mm | 1 hr | Page shows: total features, % coverage, count at each L-level |
| 1.3 | Financial stubs already consolidated into parent as table rows | mm | 0 | Already done — no collapse needed |
| 1.4 | Create PostToolUse hook: on any commit to stackmates, compute composition ratio and log to .ai/scoreboard-readings/YYYY-MM-DD.json | stackmates | 2 hr | JSON file appears after each commit |
| 1.5 | Create SessionStart hook: read latest scoreboard reading, include in nav-session-boot.mjs briefing | stackmates | 1 hr | Nav starts every session seeing last composition ratio |
| 1.6 | Create /goodwill command: prints current readings for all three bearings | stackmates | 1 hr | Running /goodwill shows a truthful bearing panel |
Success criterion: Nav can run /goodwill and see numbers that reflect reality.
Phase 2: Wire the Protocol (Dream ↔ Engineering Settlement)
Why second: With instruments reading true, the protocol between hemispheres can be verified.
Goodwill bearing served: Hemisphere Settlement.
| # | Task | Repo | Effort | Acceptance Criteria |
|---|---|---|---|---|
| 2.1 | Create /commissioning skill: reads PRD frontmatter, outputs next action based on phase | mm | 2 hr | Nav invokes skill, gets unambiguous next action for current PRD |
| 2.2 | Create PreToolUse hook: block writes to libs/agency/algorithms/ without [COMMISSION] flag | stackmates | 1 hr | Shared infra modification without flag → blocked |
| 2.3 | Create TaskCompleted hook: update PRD frontmatter + append to commissioning log + write settlement receipt to Convex | stackmates | 2 hr | Receipt appears in Convex, PRD frontmatter updated |
| 2.4 | Update nav-session-boot.mjs: read pending commissioning updates from agent comms | stackmates | 1 hr | Briefing shows dream hemisphere changes since last session |
| 2.5 | Define engineering agent: .claude/agents/engineering.md | stackmates | 30 min | Agent definition file exists with explicit tool boundaries |
| 2.6 | Create settlement verification: receiving side confirms receipt against spec acceptance criteria, logs to Supabase | protocol layer | 2 hr | Every settlement has two records: sender's receipt + receiver's verification |
| 2.7 | Create settlement history table in Supabase: agent_settlements | stackmates | 1 hr | Settlement data persists across sessions |
Phase 3: Wire the Loop Control (NEVER STOP)
| # | Task | Repo | Effort | Acceptance Criteria |
|---|---|---|---|---|
| 3.1 | Create /loop command: reads PRD queue, picks top uncommissioned, runs plan → work → verify → commission or revert | stackmates | 3 hr | /loop picks highest-priority PRD, plans, executes, verifies |
| 3.2 | Add timebox to /orch-work: each task gets budget (default 30 min) | stackmates | 1 hr | Task exceeding budget is committed as partial with log |
| 3.3 | Create TeammateIdle hook: exit code 2 → read PRD queue → assign next task | stackmates | 1 hr | Idle teammate receives new task automatically |
| 3.4 | Create PostToolUse hook: if composition ratio regresses → git revert HEAD + log failure | stackmates | 2 hr | Regression → automatic revert + failure logged |
| 3.5 | Create /evolve-mantra command: session-end condensation of key decisions + outcomes | stackmates | 1 hr | Session learnings persist |
| 3.6 | Compute cycle velocity: PRD inception → commissioning "Operational" timestamp | protocol layer | 1 hr | Hemisphere Settlement bearing has a number |
Phase 4: Wire Agent Teams (Parallel Commissioning)
| # | Task | Repo | Effort | Acceptance Criteria |
|---|---|---|---|---|
| 4.1 | Enable agent teams feature flag | stackmates | 5 min | Flag set |
| 4.2 | Create engineering team: Builder + Verifier + Simplifier | stackmates | 2 hr | Three-role team with file ownership |
| 4.3 | Create dream team: Strategist + Copywriter + Devil's Advocate | mm | 2 hr | Dream hemisphere can parallelize |
| 4.4 | Wire /batch for venture scaffolding | stackmates | 4 hr | New venture decomposes into parallel units |
| 4.5 | Create cross-hemisphere Lead agent | protocol layer | 4 hr | Lead coordinates across hemispheres |
Phase 5: Wire the Trust Ledger (Supabase Calibration → Sui Readiness)
| # | Task | Repo | Effort | Acceptance Criteria |
|---|---|---|---|---|
| 5.1 | Define instrument schemas for agent-to-agent boundaries | protocol layer | 3 hr | Schema covers intent, conditions, energy, reputation, settlement |
| 5.2 | Create settlement verification hook: receiving agent confirms | stackmates | 2 hr | Every boundary crossing has two-sided record |
| 5.3 | Trust score computation: integrity rate per agent per boundary | stackmates | 2 hr | Each agent has trust score per boundary |
| 5.4 | Trust ledger dashboard on scoreboard reality page | mm | 2 hr | Scoreboard shows agent settlement integrity |
| 5.5 | Calibration validation: trust scores vs ground truth | protocol layer | 4 hr | Trust score predicts outcomes with >70% accuracy |
| 5.6 | Sui object model draft: Map Supabase schemas to Move structs | design doc only | 4 hr | Move structs mirror proven Supabase patterns |
6. Build Sequence Summary
| Order | Phase | Duration | Gate | Proves |
|---|---|---|---|---|
| 1 | Wire Scoreboard | Day 1-2 | Numbers match reality | System can read its own position |
| 2 | Wire Protocol | Day 3-5 | PRD cycles produce verified settlements | Agents can settle across hemisphere boundary |
| 3 | Wire Loop Control | Day 6-8 | 2hr unattended run, >50% keep rate | System iterates autonomously |
| 4 | Wire Agent Teams | Day 9-12 | Venture scaffolded in <4hr parallel | System scales horizontally |
| 5 | Wire Trust Ledger | Day 13-18 | 50+ verified settlements, trust predicts outcomes | Settlement data earns right to go on-chain |
What Nav Should NOT Do
- Don't build the trust ledger dashboard before Phase 1 instruments read true
- Don't enable Agent Teams before single-agent
/loopworks - Don't write new scoreboard pages. Fix existing ones
- Don't modify shared infra (algorithms, hex architecture) as part of this work
- Don't deploy to Sui before Supabase trust data is validated against reality
What the Dream Hemisphere Will Do in Parallel
- Review scoreboard readings against source data (commissioning gate for Phase 1)
- Author next PRD in priority queue with scoring
- Update strategy page with live positioning data when instruments exist
- Verify settlement receipts from engineering (receiver validates, never sender)
7. Claude Code Feature Mapping
7.1 The Four-Layer Stack → Mantra Hierarchy
| Mantra Level | Claude Code Layer | Phase | Application |
|---|---|---|---|
| System (prevents error) | Hooks (deterministic, fires automatically) | 1-3 | PostToolUse → compute settlement score. PreToolUse → protect shared infra. TeammateIdle → NEVER STOP. TaskCompleted → update trust ledger |
| Hook (fires automatically) | Skills (auto-discovered .md + supporting files) | 2 | /commissioning skill = procedural memory for handoff protocol |
| Rule (context loaded, applied) | Commands (explicit invocation) | 1-3 | /loop (autonomous iteration), /goodwill (read bearings), /evolve-mantra (session-end learning) |
| Mantra (remembers phrase) | Agent memory (persists across sessions) | 2-5 | memory: project scope in agent frontmatter |
7.2 Custom Agent Definitions
Engineering Agent (.claude/agents/engineering.md):
- Full write access: venture-specific code, tests, commissioning data
- Read-only: libs/agency/algorithms/, shared infrastructure
- To modify shared infra: commit message must include
[COMMISSION] - Every task completion produces a receipt
- The receiving agent (dream team) verifies settlement
Dream Agent (.claude/agents/dream.md):
- Read-only on stackmates repo
- Write access: mm repo pages, PRD specs, strategy updates
- Verifies settlements — never generates them
- When engineering sends a commissioning receipt: verify against spec
8. The Telco MEV Connection
In telco routing:
CALLER (intent) → ROUTING ALGORITHM → INFRASTRUCTURE → CDR (settlement) → BILLING (feedback)
MEV extracts value from the gap between intent and settlement — the router takes a cut. The Dreamineering thesis inverts this: minimize the gap so both agents compound trust. MEV isn't extracted by a middleman. It's returned to the network as goodwill — making the network more valuable, attracting more agents, creating more settlement opportunities.
That's DePIN. That's the tokenomics. The network's value IS the accumulated settlement integrity of every agent-to-agent transaction. The token doesn't represent equity in a company. It represents a claim on the trust ledger of the network.
The Supabase calibration (Phase 5) is proving this thesis with data before encoding it on-chain. Same pattern: understand the decision tree better than veterans, build the spreadsheet proof, earn the right to build the system.
9. Questions
If the scoreboard commissions itself to L4 using this plan, is that the first capability to reach L4 — and is that the story that proves the system works?
- When
/loopruns unattended for 2 hours with >50% keep rate, what does that tell you about human judgment in the inner loop — and what does it mean for the displaced workers thesis? - The trust score is computed from settlement integrity. But settlement is binary (arrived = promised, or didn't). Where does the nuance live? Is "partially matched" a thing, or does the instrument need to be crisp enough that every settlement is yes/no?
- When the first external venture founder scaffolds on the mycelium and their agent settles with Stackmates agents for the first time — that's the External Settlement bearing's first reading. What does a healthy first reading look like?
- The instrument carries intent, energy, conditions, and reputation simultaneously. In the Supabase prototype, these are columns in a table. On Sui, they're fields in a Move struct. Is there an intermediate form — something more structured than JSON but less permanent than on-chain — that accelerates the calibration?
- Goodwill flows like water — finding every crack, going around, through, under the rock. If the trust ledger shows a boundary with low settlement integrity, does goodwill route around it automatically? Or does the system need explicit rerouting logic? Is that the explore-exploit algorithm applied to trust?
Context
- PRD Index — Hemisphere Bridge → Agent Platform Phase 4
- Prompt Deck — 5-card pitch
- Pictures — Pre-flight maps