Data Quality Framework in 2026: Tests, Monitoring, and Observability
Data quality has matured into operational discipline. Where the framework actually sits in 2026.
Data quality stopped being a buzzword the moment finance teams started building dashboards on top of dbt models and discovered that “revenue” had three different definitions across four marts. By 2026, the conversation has moved from awareness to operational discipline: who owns the test, what the SLA looks like, how an incident gets paged, and how the executive scorecard rolls up. The tools are mature, the patterns are well known, and the differentiator is no longer “do you run tests” but “do your tests reflect a contract that someone has signed up to enforce.”
This post walks through what a working data quality framework looks like in 2026 — the six dimensions, the tool stack, the rollout sequence, and the metrics that actually surface to the C-suite.

The six dimensions that frame everything#
Most mature programs anchor their framework on six classical dimensions, and the language is worth pinning down because it shows up in every contract and SLA.
Accuracy — values reflect the real-world thing they represent. An order total matches what the customer was charged.
Completeness — required fields are populated and required rows exist. No null customer_id on transactions, no missing days in the daily-active-users table.
Consistency — the same entity has the same value across systems. The customer record in Salesforce, the warehouse, and the billing system agree.
Timeliness — data lands within its freshness SLA. The marketing dashboard reflects yesterday’s spend by 9 a.m., not 11.
Uniqueness — primary keys actually are primary. No duplicate orders, no double-counted leads.
Validity — values conform to the expected format and domain. ISO country codes, valid email patterns, currencies within an expected enum.
Some teams add a seventh — integrity, which covers referential constraints — but the six above cover roughly 95% of what tests get written against.
The 2026 tool stack#
The tooling has converged on a layered model: cheap inline tests near the transformation, richer validation in CI, and observability on top of production data.
dbt tests remain the most-deployed layer. Generic tests (unique, not_null, accepted_values, relationships) plus dbt_expectations and elementary cover the bulk of model-level coverage. They run in CI and on every production build, which is exactly where most quality regressions get caught.
Great Expectations is the heavyweight validation framework — expectation suites, data docs, profiling. It tends to live upstream of the warehouse: raw landing zones, Spark jobs, ML feature pipelines.
Soda Core and Soda Cloud sit between dbt and a full observability platform. SodaCL is a readable YAML check language; Soda Cloud adds collaboration, incident routing, and dashboards.
Monte Carlo is the category leader in data observability. It ships ML-driven anomaly detection on row counts, freshness, schema, and distribution without requiring you to author the check by hand. Pricing is enterprise-tier, and the value lands when you have more than 200 critical tables to watch.
Bigeye, Anomalo, Acceldata, and Datafold round out the observability and diffing layer. Datafold’s value prop — diff-based PR checks that show how a dbt change moves real data — has become a default in pull-request workflows at serious teams.
The right mix is usually dbt tests for everything, Soda or Great Expectations for landing-zone contracts, and Monte Carlo or Bigeye for production observability on the top 100 tables that matter.
How to actually roll it out#
Programs that fail tend to start with a tool selection. Programs that work start with a small list of critical tables and an owner.
The rollout sequence we see work most often: pick the 20 most consequential tables (the ones the CFO, the head of growth, and the head of product look at every day), publish a written contract for each — schema, freshness SLA, owner, escalation path — then bolt on dbt tests for the cheap dimensions, Soda for landing-zone validation, and Monte Carlo on the same 20 tables for anomaly coverage. Once that is live and on-call rotations actually fire, you expand the perimeter to the next 80 tables and standardize the pattern.
Trying to instrument 2,000 tables on day one is how programs die. The 100-table beachhead pattern is boring, repeatable, and works.
Data contracts as the connective tissue#
Tests without contracts produce noise. By 2026, the mature approach pairs every quality check with a written contract between producer and consumer. The contract names the schema, the freshness SLA, the allowed null rate, the owner, and the breaking-change policy. Tools like Gable, Schemata, and increasingly first-party support in Confluent Schema Registry and the warehouse-native catalogs (Unity Catalog, Snowflake Horizon, Polaris) have made this less artisanal than it was in 2024.
The relationship is straightforward: the contract specifies the expectation, the test enforces it, and the observability platform watches for the drift the test does not catch.
Executive metrics that actually matter#
The CFO does not want to read a list of failed dbt tests. The metrics that travel upward are simpler and more honest:
- Critical-table SLA hit rate — of the top 100 tables, what percentage met their freshness and quality SLA last month.
- Mean time to detection and mean time to resolution for data incidents, tracked the same way SRE tracks site incidents.
- Open data incidents by severity with an owner against each one.
- Coverage — percentage of critical tables under contract, percentage with monitoring, percentage with a named owner.
When the executive scorecard reads like an SRE scorecard, the program gets funded. When it reads like a tool inventory, it gets cut.
Where pdpspectra fits#
Our data engineering practice builds and operates the quality layer — dbt test suites, Soda or Monte Carlo deployment, contract authoring, on-call playbooks, and the executive scorecard that makes the program legible to finance. We do the boring 100-table beachhead, not the buzzword tour.
Related reading: the data contracts post, the dbt advanced patterns post, and the data stack as operational engine post.
Data quality is operational discipline, not a tool purchase. Talk to our team about your data platform.