Tag Archives: data science

The Joy of Data Viz: The Data You Weren’t Looking For

December 8, 2015

“A good sketch is better than a long speech” – Napoléon Bonaparte I recently came across this quote on the opening page of Phil Simons’ book,  “The Visual Organization, Data Visualization, Big Data, and the Quest for Better Decisions.” It is available online. Data visualization, or Data Viz as it is often referred to, is… Read More »

Data Modeling Requires Detailed Mapping — Learn Why

November 30, 2015

Mapping is an important step to understanding your data and where the data resides in your ecosystem. Mapping takes us from the known to the unknown and is effectively accomplished by using mapping tools, adopting best practices, and having a common understanding of how the mappings will be used. But mapping does take a considerable amount… Read More »

Pluralism and Secularity In a Big Data Ecosystem

August 25, 2015

Solutions around today’s analytic ecosystem are too technically driven without focusing on business values. The buzzwords seem to over-compensate the reality of implementation and cost of ownership. I challenge you to view your analytic architecture using pluralism and secularity. Without such a view of this world your resume will fill out nicely but your business… Read More »

Optimization in Data Modeling 1 – Primary Index Selection

July 14, 2015

In my last blog I spoke about the decisions that must be made when transforming an Industry Data Model (iDM) from Logical Data Model (LDM) to an implementable Physical Data Model (PDM). However, being able to generate DDL (Data Definition Language) that will run on a Teradata platform is not enough – you also want… Read More »

Why We Love Presto

June 24, 2015

Concurrent with acquiring Hadoop companies Hadapt and Revelytix last year, Teradata opened the Teradata Center for Hadoop in Boston. Teradata recently announced that a major new initiative of this Hadoop development center will include open-source contributions to a distributed SQL query engine called Presto. Presto was originally developed at Facebook, and is designed to run… Read More »

PARTNERS DataDive: I Put Agile on Steroids & Almost Went Back in Time!

June 17, 2015

I recently participated in a business analytics project for non-profits that, as the planning progressed, seemed like a perfect opportunity to implement an agile approach, except that the work was to be completed in two days! But all the developers would be co-located. We had three objectives that fit the profile of user stories. We… Read More »

Making SAP data relevant in the world of big data

May 4, 2015

Part one of series about an old “SAP”dog who learns a new trick Reflecting back on the key messages from Teradata Universe 2015 in April it was impossible to escape the theme of deriving differentiated business value leveraging the latest data sources and analytic techniques. I heard from several customers how they improved their business by… Read More »

Harness Cross-Functional Centrality of Data Analytics for Competitive Advantage – Part 1 of 2

April 7, 2015

High Level Data Analytics Graph (Healthcare Example)  <—- Click on image to view GRAPH ANIMATION Michael Porter, in an excellent article in the November 2014 issue of the Harvard Business Review[1], points out that smart connected products are broadening competitive boundaries to encompass related products that meet a broader underlying need. Porter elaborates that the boundary shift… Read More »

Hybrid Row-Column Stores: A General and Flexible Approach

March 10, 2015

During a recent meeting with a post-doc in my lab at Yale, he reminded me that this summer will mark the 10-year anniversary of the publication of C-Store in VLDB 2005. C-Store was by no means the first ever column-store database system (the column-store idea has been around since the 70s — nearly as long… Read More »

Data-Driven Design: Smart Modeling in the Fast Lane

February 24, 2015

In this blog, I would like to discuss a different way of modeling data regardless of the method such as Third Normal Form or Dimensional or Analytical datasets. This new way of data modeling will cut down the development cycles by avoiding rework, be agile, and produce higher quality solutions. It’s a discipline that looks… Read More »