The Impact of Poor Data Quality on Machine Learning

We are surrounded by huge amount of data. Data is everywhere and is gaining huge importance and relevance in today’s world. There are many firms that are performing tasks of gathering, retrieving and managing data. This requires systems that can help us handle that much amount of data. Machine Learning has helped us in gathering…

gender bias

Is “Bias” the 7th big data quality metric

A few weeks back I wrote about the The 6 dimensions of big data quality. These are: Coverage – how well does the data source meet (or fail to meet)  the business need? Continuity – How well does the data set cover all expected or needed intervals? Triangulation – How consistent is data when measured form…

The 6 dimensions of big data quality

Historically, data quality has been measured in terms of dimensions including: Accuracy – the degree to which data reflects the real world Completeness – data that is adequately populated Timeliness – That data is available when expected and needed Consistency – that data across all systems reflects the same reality Conformity – that data has…