Agent Platform
When agents need identity, memory, scaffold generators, and boundary enforcement to operate safely and improve autonomously — the PUMP that powers the factory.
Why should I care?
Five cards that sell the dream
Same five positions. Different seat. The operator asks "can I trust the dashboard?" The agent asks "what am I allowed to touch?"
How do we build this?
Five cards that sell the process
67+ CLI commands, 8 auditor dimensions, 542+ tests. But agents create files in wrong directories, repeat mistakes across sessions, and can't recall what worked before. Plans dashboard exists but math is wrong. Scaffold generators exist as functions but have no CLI surface.
One platform where agents have identity (who am I), memory (what do I know), scaffolds (how do I create), and boundaries (what can't I touch). The PUMP that powers every other inner-loop PRD.
Agent capability is scattered across 4 repos, 7 skill files, and 3 database schemas. No unified surface. The boundary between 'agent can do this' and 'agent must not do this' is implicit, not declared.
Agent boundaries that are too tight prevent useful work. Too loose and agents break things. The boundary must be declared per-agent, enforced by hooks, and learnable from patterns.
Priority (5P)
Readiness (5R)
What Exists
| Component | State | Gap |
|---|---|---|
| Plans dashboard UI | Stub | Page exists at /plans. Math wrong (5 issues). No drill-down. No project grouping. |
| Scaffold generator functions | Working | Functions exist in scaffold-generators.ts. Not wired to drmg CLI. No content-type registry. |
| Agent boundary hooks | Stub | One proof-of-concept (src-post-edit.sh). No per-agent scope declarations. |
| Virtue auditor (pattern tracking) | Working | 8 dimensions track trends. No cross-run extraction. No prevention proposals. |
| Agent memory DB schema | Working | agent_memory_stores table with vector column exists. No write pipeline. No recall query. |
| DRMG CLI (67+ commands) | Working | Unified binary works. Scaffold namespace not yet added. |
| Agent config (.claude/agents/) | Working | Agent definitions exist. No scope declarations per agent. |
Kill Signal
Boundary hooks block >30% of legitimate agent actions after 30 days. Agent task completion rate drops below current baseline.
Questions
Where does automation end and human judgment begin for agent boundaries?
- If agents can extract their own patterns, will they converge on the same rules humans would write?
- Should memory be per-agent or shared across all agents in a session?
- At what point does scaffold templating become over-engineering — when does an agent just write the file directly?
- Can boundary violations be the training signal for better scope declarations?