Skip to main content

ETL Data Tool — Agent & Instrument Diagram

How do agents orchestrate data acquisition?

System Diagram

┌─────────────────────────────────────────────────────────┐
│ SCHEDULE CONTROLLER │
│ (Cron: monthly bulk, weekly delta) │
└──────────────────────┬──────────────────────────────────┘
│ triggers

┌─────────────────────────────────────────────────────────┐
│ ETL AGENT │
│ │
│ ┌──────────┐ ┌──────────┐ ┌──────────────┐ │
│ │ NZBN │ │ CO │ │ Crawl4AI │ │
│ │ Extractor│──▶│ Enricher │──▶│ Enricher │ │
│ │ (API) │ │ (API) │ │ (Docker) │ │
│ └──────────┘ └──────────┘ └──────────────┘ │
│ │ │ │ │
│ └──────────────┼───────────────┘ │
│ ▼ │
│ ┌───────────────┐ │
│ │ TRUST │ │
│ │ INSTRUMENT │◄── Scores every record │
│ │ (0-100) │ before load │
│ └───────┬───────┘ │
│ │ │
│ ▼ │
│ ┌───────────────┐ │
│ │ TYPE-SAFE │ │
│ │ LOADER │◄── Drizzle + rollback │
│ │ (PostgreSQL) │ │
│ └───────────────┘ │
└─────────────────────────────────────────────────────────┘

▼ FEEDBACK LOOP
┌─────────────────────────────────────────────────────────┐
│ DOWNSTREAM CONSUMERS │
│ │
│ Sales CRM ◄── business profiles + contacts │
│ Sales Dev ◄── qualified leads (trust > 70) │
│ Nowcast ◄── business change signals (weekly delta) │
│ BIG ◄── market data for idea validation │
│ │
│ Query rate per entity = kill signal instrument │
└─────────────────────────────────────────────────────────┘

P&ID Translation

P&ID ElementA&ID EquivalentInstance
ProcessETL AgentExtracts, transforms, loads
InstrumentTrust Scoring EngineMeasures data quality (0-100)
PipelineThree-layer architectureAPIs → Enrichment → Scoring
Control LoopSchedule Controller + Kill SignalCron triggers + query rate feedback
ValveTrust threshold (score ≥ 40)Blocks low-trust data from downstream

Feedback Loops

LoopSignalResponse
Trust thresholdScore < 40 on incoming recordFlag for re-verification, do not serve to consumers
Query rateZero queries after 14 daysKill signal — stop extraction, investigate demand
Freshness decayLast refresh > 7 daysSchedule controller triggers delta run
Cross-referenceNew source confirms existing recordTrust score increases (cross_reference dimension)

Context