Tag Archives: data lake

Is your DbFit?

Wednesday December 7th, 2016

In my previous blog I talked about modular design enabling continuous integration and automated testing through to delivery. Many customers I have worked with find the immediate challenge is how to do the testing, especially as it applies to a data warehouse or lake. In this blog I will look at how to setup simple,… Read More »

Schrödinger’s Cat and Big Data

Monday June 6th, 2016

Schrödinger’s Cat is a thought experiment developed by Erwin Schrödinger (1887-1961) to illustrate that micro-scale quantum effects can be made to produce real (and quite bizarre) effects in the real world. Being Slightly Dead In this case, Schrödinger uses superposition on atomic scale to affect the life-span of a cat. The cat is placed in… Read More »

Deploy Hadoop Multiple Ways

Tuesday February 23rd, 2016

The current Hadoop market is dominated by two players being Cloudera and Hortonworks. Both are built on top of open source Hadoop and are very similar in their packaging except with a few differences in applications (Impala, Ambari, Ranger, Sentry etc etc) from a software perspective and their support structures. Standing on the sidelines reminds… Read More »

Drowning in the Data Lake

Wednesday October 15th, 2014

IT people always try to use soothing names for complex propositions (don’t we all love the fluffy Cloud, being Service-Oriented [I sometimes wish that the restaurant sector would adopt this] or promising our customers that we are, above all, Agile?). The new buzzword is the data lake, which immediately brings to mind visions of calm… Read More »

Metadata Management in the Data Lake

Wednesday September 24th, 2014

I have spoken to many organisations recently and they are considering Hadoop platforms for a variety of reasons not least the concept of a “data lake”. In my last blog I introduced the concept of a data lake, the benefits, it’s fit within an enterprise and why it’s not a data warehouse. However whilst many are… Read More »