Blog PETER.PITKIN

Open in Google Translate
Technology & StrategyPost 13 of 13 · Series Finale

Data Problems Are Never About Data

Over the years I have been involved in numerous initiatives described as data problems — quality issues, reporting inconsistencies, integration challenges. In almost every case, the assumption that this was a technical matter was wrong. Data problems are almost never about data.

Over the years I have been involved in numerous initiatives that were described, at their outset, as data problems. Data quality issues. Reporting inconsistencies. Integration challenges where different systems produced different answers to the same question. In each case, the initial assumption was that this was a technical matter — a problem with data models, data cleanliness, or tooling. And in almost every case, that assumption was wrong.

Data problems are almost never about data. They are about the organisation that produces it.

01

The Pattern That Repeats

The symptoms are always recognisable: inconsistent reports, missing information, conflicting figures for the same business metric across different parts of the organisation. The initial response is also consistent: improve the data model, introduce better tooling, clean up the datasets. All logical steps — and all, in isolation, insufficient.

In one organisation, multiple reports showed different figures for the same business metric. The obvious assumption was data inconsistency somewhere in the pipeline. We investigated the systems, the interfaces, the transformation logic. The data itself was not fundamentally incorrect. What we eventually found was that different teams were defining the same metric differently. Same name, different calculation — each defensible in isolation, none aligned with the others. No amount of data cleansing would have resolved it, because the problem was not in the data. It was in the organisation's failure to agree a shared definition.

The data was accurately reflecting the organisation. The organisation just did not like what it saw.

02

The Real Causes

The same underlying causes appear across sectors, organisation types, and data maturity levels. None of them are data problems. All of them produce data problems.

Cause 01
No Clear Ownership
Who actually owns this data? In most organisations, the honest answer is: multiple teams contribute to it, but no one is clearly accountable for its quality. Ownership that is shared across multiple functions without a single named accountable party is ownership that exists in documentation but not in practice.
Cause 02
Inconsistent Definitions
The same term means different things in different parts of the organisation. "Customer," "active account," "revenue" — each may carry assumptions that vary between finance, operations, and commercial teams. Without a shared, enforced definition, consistency is structurally impossible regardless of how well the data is managed technically.
Cause 03
Fragmented Processes
Data reflects the processes that generate it. If the processes are inconsistent — if different teams record the same type of activity in different ways, at different times, with different levels of completeness — the data will be inconsistent. Improving the data infrastructure does not improve the underlying process fragmentation.
Cause 04
Organisational Silos
Different parts of the business optimise their own data management for their own purposes, producing multiple local versions of the truth. Each is internally consistent. None is aligned with the others. The result is not a data integration problem — it is an organisational alignment problem that manifests in the data.
03

The Technology Trap

The most consistent mistake I have seen in response to data quality problems is the assumption that technology will resolve them. New data platforms. New governance tooling. New architectures. Data lakes, data warehouses, data meshes — each generation of technology promises to solve the problem that the previous generation did not.

Technology can make good data governance easier to implement and easier to sustain. It cannot create the ownership, definitions, and organisational alignment that good data governance requires. A data catalogue populated with assets whose ownership is unclear does not become useful because the catalogue is well-designed. A data mesh with poorly defined domain boundaries does not produce coherent data products because the platform is well-built. The technology exposes the organisational problem — it does not solve it.

The organisations that invest heavily in data technology before resolving the underlying ownership and definition questions consistently find that they have built sophisticated infrastructure for managing data that is still inconsistent at its source.

04

What Actually Makes the Difference

The improvements I have seen that were both significant and durable shared a common characteristic: they addressed the organisational questions before, or alongside, the technical ones. Not instead of them — the technology matters — but in the right sequence.

1
Assign clear, non-shared accountability for data quality
Not a governance committee, not a shared responsibility between functions — a named individual who owns the outcome and has the authority to set and enforce standards for the data in their domain. This is the single highest-leverage intervention in any data quality programme.
2
Define key metrics precisely and enforce the definitions
Not aspirationally — with the specificity required to produce the same figure from two different systems. This work is slower and more contentious than it appears, because the differences in definition usually reflect real differences in how parts of the business think about their work. Surfacing and resolving those differences is a business conversation, not a data conversation.
3
Address the process fragmentation at source
Data quality problems that originate in inconsistent processes cannot be fully resolved downstream. The investment in harmonising how data is created — not just how it is stored, governed, and reported — is frequently underestimated and frequently the thing that makes the difference.
4
Then apply the technology
Once the ownership is clear, the definitions are agreed, and the source processes are aligned, technology does what it is actually good at: automating enforcement, improving visibility, making it easier to sustain standards at scale. In that sequence, the technology investment delivers its promise.
Final Thought — and a Final Reflection

Data is not the problem. It is a reflection of the organisation that produces it. If the organisation is unclear about ownership, inconsistent in its definitions, fragmented in its processes, and siloed in its decision-making — the data will be too. With great fidelity.

Fixing the data without fixing the organisation is cleaning a mirror rather than improving what it reflects.

That observation — along with the twelve that preceded it in this series — comes from fifty years of watching organisations struggle with the same underlying challenges: complexity, alignment, decision-making, and the ability to execute consistently over time. The technology changes. The challenges persist. I hope these reflections have been useful to those navigating them.

← PreviousThe Most Dangerous Words in IT: "We'll Fix It Later"Series CompleteReflections from 50 Years in ITAll 13 posts now published →

That's the complete series

All 13 posts from Reflections from 50 Years in IT are now published.

View the Full Series →