Eighty-twenting data

almeidajm

80/20 rule, critical data elements, data quality, pareto principle

Eighty-twenting data

Discover the power of the Pareto principle in data quality management. Learn how to identify the crucial 20% of data that drives business value and efficiency. Explore insights from Joseph M. Juran’s quality management and optimize your data assets for maximum impact.

I entered the world of data, starting with data quality, making quality one of the foundational themes for all the work I’ve been producing since.

“If I have seen further it is by standing on the shoulders of Giants” is one of my favourite quotes, belonging to Isaac Newton, working as a reminder that everything we know and do is a compound of work done before us.

If I have seen further it is by standing on the shoulders of Giants
Isaac Newton

One of these giants is Joseph M. Juran, whose work in the field of quality management is still a reference.

Introducing the Pareto principle

So why am I bringing Juran here? Mainly because he introduced the Pareto principle to quality issues, verifying that a small percentage of root causes contributes to a high percentage of defects.

The Pareto principle or 80/20 rule, follows the observations of economist Vilfredo Pareto, whose studies showed that 80% of the land in Italy was owned by 20% of the population.

Although I’ve frequently used this principle while dealing with data quality issues, this is a principle – even though there is little scientific analysis that either proves or refutes its validity – that is frequently used in many different fields.

Prioritising Critical Data

This is also true when reflecting on some of the issues faced by those who have responsibilities in the data management area, and if correctly applied can bring a better understanding of the issues and possibly, additional benefits, cutting costs, and increasing some efficiencies – Or at least to be used as a tool to identify priorities.

Putting it in a different way – considering data as a corporate asset – the rule allows an organization to identify its best assets and use them efficiently to create maximum value.

Keep in mind that 80-20 is only a guideline, it’s in fact almost a branding name. What those two numbers measure are outputs and inputs, not even necessarily using the same units. So, it can easily be 70-30, 90-10 or whatever combination.

Questions to Prioritise Data Issues

What I’m proposing here is a questioning exercise, that will allow in certain situations to do a more efficient allocation of resources, or even help to define future investments.

Asking questions like:

Which 20% of data produces most business valuable insights?
Which 20% of data is more critical for business continuity?
Which 20% of data is more liable to security risks?
Which 20% of data is more frequently accessed?
Which 20% of data is less frequently updated?
Which 20% of data is more critical for regulatory purposes?
Which 20% of data is taking more processing time in loading and transformation processes?
Which 20% of data is the cause for most of the data quality problems? *

These are just a few examples of questions that can be put and that can in some situations lead to a change in perspective followed by some specific actions, especially when we start crossing the answers from different questions.

As an example, trying to identify the 20% of data that is most valuable to the organization, will allow us to better prioritize and define any future data initiative, review the current ones or even adapt ongoing initiatives to maximize the efficiency of the data architecture overall.

_____

Jose Almeida has over 20 year’s experience in the delivery of data management solutions in Europe, the Middle East and Africa. This article was first published on his blog and is reprinted with permission

Tags:

80/20 rule, critical data elements, data quality, pareto principle

Eighty-twenting data

Leave a comment Cancel reply

Related posts

Eighty-twenting data

Introducing the Pareto principle

Prioritising Critical Data

Questions to Prioritise Data Issues

Share this:

Leave a comment Cancel reply

Related posts

Discover more from Data Quality Matters