Four steps to transforming data

Another brief post this week on an area that we do not focus on very often: data transformation. Data transformation is a relatively mundane yet fundamental data management capability – particularly when dealing with similar data from multiple sources. Three simple examples: System A represents Male and Female and 0 and 1, while System B…

Why Hadoop

Hadoop: Quick Facts

Hadoop is a highly scalable, NoSQL database used to perform high speed analytics against large volumes of data. Hadoop works on the principle of schema on read, not schema on write. Any data (structured or unstructured) can be stored in Hadoop with out developing a schema. This cuts the development time scales, reduces risk complexity…

Data Migrations good opportunity to improve data quality

Many large system implementations are driven due to a perception (fueled by the sales team) that the new system will miraculously address data quality issues (in the existing system) that inhibit business’ ability to function effectively. We have, for example, seen a number of client replacing existing billing systems with large ERP packages. Similarly, a lot of…