Inner Loop
We built a plan CLI with 40+ commands and 114 plans. The CLI says 74% complete. The UI said 1%. The boss still asks "are we on track?" — and we can't answer without a terminal.
These six projects close that gap. They are not the product. They are the machine that builds the product.
Agent sessions scatter results. The 'why' dies in Slack. North Stars exist as prose — none queryable. Plans dashboard math is wrong.
Seven skills span two repos, none chain. Every step needs a human trigger. Memory exists in DB — nothing reads it back into context.
One trigger runs the loop. Nav profile primes from DB. Decision traces persist automatically. Feature states compute from tests.
Morning diff in 30 seconds. Overnight loops produce proven demand. Commissioners trust the matrix. Factory measures its own output.
The Control System
Six roles. One loop. Each feeds the next.
Ship Now
≥ 3.02 PRDsProject Management System
SYSTEMTrack state of things across cadences so we can plan days, weeks, months and know if we're on track.
CLI Platform
INSTRUMENTOne binary (drmg) with shared platform lib, repository-only DB access, agent-grade contracts.
Prepare
2.0–2.94 PRDsWorkChart Bridge
CONVEYORBridge dream repo work charts to engineering execution engine — blueprints become running code.
Agent Platform
PUMPIdentity, memory, scaffold generators, boundary enforcement — the PUMP that powers the factory.
Decision Tracer
GAUGEDurable traces of the 'why' behind every priority change.
Autoresearch Loop
CONDUCTORChain skills into overnight autonomous loops.
What We Need
Six CLI commands. Each one closes a gap in the loop. Build in order — earlier commands unblock the later ones.
| PRD | Command |
|---|---|
| Agent Platform (PUMP-005)Register nav in AGENT_PATHS + create semantic/procedural memory JSON files | drmg etl load --agent=nav |
| Agent Platform (PUMP-005)Read profile + memories from DB, write nav-profile.md for context-aggregator | drmg agents context --agent=nav |
| Autoresearch Loop (LOOP-002)Return North Star metric as scalar (e.g. won_deals/total = 0.28) | drmg measure --prd=<slug> |
| WorkChart Bridge (CONVEYOR)Move dream repo work chart stage to plan CLI task automatically | drmg workchart advance --prd=<slug> --stage=<n> |
| Auto Commissioning (CONTROLLER)Compute L-level from test results, detect false-L3s, write receipt | drmg commission --prd=<slug> |
| Decision Tracer (GAUGE)Write durable decision trace to IDecisionTracePort | drmg trace log --context='<why>' --ref=<prd> |
Build Sequence
Sprint order = pipe sequence. Earlier sprints lay tracks the later ones run on.
| Sprint | What Ships |
|---|---|
| N1 | Plans UI — math, clickable rows, project grouping |
| N2 | Plan detail page — phases, tasks, progress, evidence |
| N3 | Project dashboard — things by state across all projects |
| N4 | Cadence views — daily/weekly/monthly planning surfaces |
| N5 | Computed states — test results update feature matrix |
| N6 | Decision traces — 'why' persisted alongside 'what' |
Context
- Navigation System — Value, Belief, Control
- Control System — PID mechanics, pit of success
- Progress — Things by state, not tasks by done
- Planning — Cadences: daily, weekly, monthly, quarterly
- Business Platform Requirements — The external-facing channel
- Commissioning Dashboard — L0→L4 states across all features
Questions
If the factory can't measure its own output, how does it know what to build next?
- When does the Plans UI replace the priorities markdown page as the primary decision surface?
- Which inner loop gap costs the most engineering time each week — measured in repeated manual steps?
- If the CONDUCTOR ran overnight, what would you find by morning — and what would be missing?
- What's the minimum loop that closes before you can trust the factory to run unsupervised?