Machine learning depends on quality data

Machine learning and artificial intelligence are the new hot topics in data analytics. These topics define subsets of data science that are primarily characterized by mathematical and statistical processes applied to data. In machine learning, algorithms replace humans in interpreting data. The expectation is that the machine will make purely data-driven ( better ) decisions…

Monkeys, bananas and machine learning

There is an old story about an experiment involving nine monkeys. Four monkeys are placed in a cage. Every day, a plate of fresh fruit is placed into the cage. As the monkeys reach for the food the keepers come in and beat them. Over the course of a few weeks the monkeys learn not…

The Impact of Poor Data Quality on Machine Learning

We are surrounded by huge amount of data. Data is everywhere and is gaining huge importance and relevance in today’s world. There are many firms that are performing tasks of gathering, retrieving and managing data. This requires systems that can help us handle that much amount of data. Machine Learning has helped us in gathering…

gender bias

Is “Bias” the 7th big data quality metric

A few weeks back I wrote about the The 6 dimensions of big data quality. These are: Coverage – how well does the data source meet (or fail to meet)  the business need? Continuity – How well does the data set cover all expected or needed intervals? Triangulation – How consistent is data when measured form…