Skip to main content

Inner Loop

We built a plan CLI with 40+ commands and 114 plans. The CLI says 74% complete. The UI said 1%. The boss still asks "are we on track?" — and we can't answer without a terminal.

These six projects close that gap. They are not the product. They are the machine that builds the product.

Pain

Agent sessions scatter results. The 'why' dies in Slack. North Stars exist as prose — none queryable. Plans dashboard math is wrong.

Friction

Seven skills span two repos, none chain. Every step needs a human trigger. Memory exists in DB — nothing reads it back into context.

Desire

One trigger runs the loop. Nav profile primes from DB. Decision traces persist automatically. Feature states compute from tests.

Potential

Morning diff in 30 seconds. Overnight loops produce proven demand. Commissioners trust the matrix. Factory measures its own output.

P&ID Role Map

The Control System

Six roles. One loop. Each feeds the next.

CONDUCTOR
Autoresearch Loop

Chains skills into overnight autonomous loops

L0
PUMP
Agent Platform

Identity, memory, comms, dispatch — moves the work

L2
INSTRUMENT
CLI Platform

drmg binary — one control surface, 60+ commands

L4
CONVEYOR
WorkChart Bridge

Blueprints become running plans — dream to eng pipeline

L0
GAUGE
Decision Tracer

Durable traces of the 'why' behind every change

L0
CONTROLLER
Automated Commissioning

Test results compute feature states — closes the loop

L0

What We Need

Six CLI commands. Each one closes a gap in the loop. Build in order — earlier commands unblock the later ones.

PRDCommand
Agent Platform (PUMP-005)Register nav in AGENT_PATHS + create semantic/procedural memory JSON filesdrmg etl load --agent=nav
Agent Platform (PUMP-005)Read profile + memories from DB, write nav-profile.md for context-aggregatordrmg agents context --agent=nav
Autoresearch Loop (LOOP-002)Return North Star metric as scalar (e.g. won_deals/total = 0.28)drmg measure --prd=<slug>
WorkChart Bridge (CONVEYOR)Move dream repo work chart stage to plan CLI task automaticallydrmg workchart advance --prd=<slug> --stage=<n>
Auto Commissioning (CONTROLLER)Compute L-level from test results, detect false-L3s, write receiptdrmg commission --prd=<slug>
Decision Tracer (GAUGE)Write durable decision trace to IDecisionTracePortdrmg trace log --context='<why>' --ref=<prd>

Build Sequence

Sprint order = pipe sequence. Earlier sprints lay tracks the later ones run on.

SprintWhat Ships
N1Plans UI — math, clickable rows, project grouping
N2Plan detail page — phases, tasks, progress, evidence
N3Project dashboard — things by state across all projects
N4Cadence views — daily/weekly/monthly planning surfaces
N5Computed states — test results update feature matrix
N6Decision traces — 'why' persisted alongside 'what'

Context

Questions

If the factory can't measure its own output, how does it know what to build next?

  • When does the Plans UI replace the priorities markdown page as the primary decision surface?
  • Which inner loop gap costs the most engineering time each week — measured in repeated manual steps?
  • If the CONDUCTOR ran overnight, what would you find by morning — and what would be missing?
  • What's the minimum loop that closes before you can trust the factory to run unsupervised?