Skip to main content

← ETL Data Tool · Prompt Deck · Spec

ETL Data Tool — Pictures

Pre-flight maps for the data acquisition pipeline.

Maps

MapQuestionStatus
Outcome MapWhat does success look like?Complete
Value Stream MapWhere does time die?Complete
Dependency MapWhat must happen first?Complete
Capability MapWhat can we actually do?Complete
A&IDHow do agents orchestrate?Complete

Key Finding

The three-layer architecture (Structured APIs → AI Enrichment → Trust Scoring) emerged from the research. NZBN API is the authoritative foundation. Crawl4AI enriches. Trust scoring validates. Each layer has a known cost ($0) and known trust level.

Pictures Bridge

MapKey InsightFeeds
Outcome100 NZ businesses, trust-scored, queryable in < 2sSpec: Quality Targets
Value StreamNZBN (bulk) → CO (directors) → Crawl4AI (enrich) → Score → LoadSpec: Build Order
DependencyNZBN API key is the only external gate. Everything else is open-source.Spec: Sprint 0
CapabilityRepos exist (73 types), ETL CLI exists, connectors exist. Gap = acquisition layer.Spec: Build Ratio
A&IDETL Agent (extracts) + Trust Instrument (scores) + Schedule Controller (triggers)Spec: Agent-Facing Spec

Context

Questions

What is the most important visual missing from the etl data pipelines picture set — and why does it matter?

  • Which relationship between elements in this diagram is most underspecified — and what would happen if it were wrong?
  • If this picture were shown to a new engineer on day one, what would they misunderstand — and how should the picture be changed?
  • What assumption does this visual make that should be made explicit in the spec?