einstein-curious

Data scientists must see the story behind the data.

“Some people look at data and see integers, booleans and strings. When I look at data, I always wonder what the story is behind it.” – Stefan Groschupf, data scientist and CEO of Datameer. Like Stefan, I see the stories behind the data. This is what makes data interesting, what makes big data relevant and…

Cosmos in the Free State

Data Profiling lessons from the Boer war

South Africa is a beautiful country. While international tourists may fly in to well known destinations, such as Cape Town or the Kruger National Park, locals tend to use the roads. A common sight at this time of the year are the swathes of wild Cosmos lining the road sides throughout much of the interior.…

Garbage In

Clean data versus more data

A SearchCIO article asks “When does more data trump clean data?“ The article highlights a common misconception used by the “dirty data” argument, starting with the line “The days of scrubbing data until it’s squeaky clean are quickly becoming a luxury“ Data quality is not, and never has been, about data being squeaky clean Tweet…

data-quality-impact-revenue

Poor data quality: The fourth path to financial ruin?

My father was in agriculture for his entire working career. One of his old jokes describes the three paths to financial ruin: Gambling, women and farming. Gambling is the quickest,women are the most fun, but farming is the surest. Is poor data quality the fourth path to financial ruin? Tweet this Poor data quality has…