What is the role of the data engineer?

Data engineering is the term that has emerged to describe the tasks related to delivering useful data for analytics – particularly in relation to data science. With between 60% and 90% of the effort of most big data project allocated to data engineering tasks, the role has matured as organisations found that traditional data scientists…

The role of a data steward

Not so long ago I received an email which, after giving my personal details, allowed me to download a Gartner report. The report talks about the difficulty organisations have understanding and explaining the role of a data steward. The author starts with mentioning some misconceptions around the role of a data steward, like: It’s a full-time…

What is a business term?

Last week, we looked at why we need a business glossary. This week we will discuss what we capture in a business glossary. First, and most obviously, we capture business terms. Business terms A business term is a word or phrase that describes a concept that is used in a particular branch of business. Examples…

christmas decorations

12 blogs of Christmas

Today is Christmas day – I thought i would share a collection of Christmas related posts from around the Internet. Happy Christmas everyone! The 12 days of data – some interesting stats on the well known Christmas song Christmas in numbers – 10 of the best uses of Christmas visualization The relativity of Einstein, Elephants,…

Four steps to transforming data

Another brief post this week on an area that we do not focus on very often: data transformation. Data transformation is a relatively mundane yet fundamental data management capability – particularly when dealing with similar data from multiple sources. Three simple examples: System A represents Male and Female and 0 and 1, while System B…

Machine learning depends on quality data

Machine learning and artificial intelligence are the new hot topics in data analytics. These topics define subsets of data science that are primarily characterized by mathematical and statistical processes applied to data. In machine learning, algorithms replace humans in interpreting data. The expectation is that the machine will make purely data-driven ( better ) decisions…