Engineering Insights8 min read

From DWH to AI Agents

The data warehouse was built for humans. The semantic layer made it accessible. AI agents are what happens when you close the loop — from data to decision, without human intervention.

AI AgentsDWHSemantic LayerAutonomous Analytics

The Evolution

The data warehouse solved the problem of fragmented operational data. Before it, every business question required exporting CSVs from five systems and reconciling them in a spreadsheet. The DWH centralised data, standardised definitions, and made consistent reporting possible.

Analytics engineering extended that foundation. dbt brought software engineering discipline to data transformation — version control, testing, documentation, and modular design. Metric definitions became code. Transformation logic became auditable. The warehouse became a governed asset, not just a database.

The semantic layer made that governed asset accessible. It translated physical table structures into business concepts that non-engineers could query. Dashboards could be built against governed metrics rather than raw tables. Self-service analytics became possible without sacrificing consistency.

AI agents are the next evolution. They use the governed semantic layer as their operating context — reading metric definitions, applying business rules, generating queries, and surfacing insights without requiring a human to initiate the request.

How Agents Use the Stack

Context from the semantic layer — before answering any question, the agent reads the relevant metric definitions, business rules, and known caveats from the semantic layer. This grounds every response in governed data.
Query generation over governed assets — the agent generates SQL against mart tables and semantic layer metrics, not raw tables. Business logic is inherited from the governance layer, not re-implemented in prompts.
Scheduled monitoring — agents run on a schedule without being asked. They check KPI deviations, compare current values to historical baselines, and surface anomalies before a human would notice them.
Escalation logic — when a deviation is significant, the agent generates a structured explanation and escalates to human review. When it is within expected variance, it logs the observation and moves on. Humans see signal, not noise.
Closed feedback loop — from data change to insight to action, no human is required to carry the message. The agent detects, interprets, and initiates.

What This Requires

AI agents over a DWH are not a shortcut. They require the full data engineering stack beneath them to be built correctly. An agent operating on ungoverneded raw data will produce confident-sounding answers that are wrong in systematic ways.

The prerequisite stack is: reliable ingestion pipelines that deliver complete, timely data; a transformation layer that enforces data quality with automated tests; a semantic layer that codifies business logic as a programmatic API; and monitoring infrastructure that catches pipeline failures before agents query stale data.

Every shortcut in that foundation shows up in the quality of agent output. The agent is the proof that the data platform is trustworthy — not a substitute for building the platform correctly.

Where We Are Now

LLMs are capable of sophisticated reasoning over structured data — the capability gap is not the bottleneck.
Most organisations have the data but lack the governance layer that makes AI reasoning trustworthy.
The highest-leverage investment is the semantic layer — it directly enables AI agents and simultaneously improves every dashboard and self-service query.
Autonomous analytics is not a future concept — it is an engineering challenge with a known solution stack: governed DWH + semantic layer + AI agent with escalation logic.

The Takeaway

The agent is only as intelligent as the platform beneath it.

The path from DWH to AI agents is not a technology swap. It is a maturity progression. Each step — data engineering, analytics engineering, semantic layer design — is a prerequisite for the next. Organisations that invest in the full stack get AI agents that reason correctly. Those that skip the foundation get confident AI that hallucinates in ways nobody can detect.