Cloudera and Tableau Enable Real-Time Business Intelligence with Apache Hadoop

Tableau allows you to connect to CDH, Cloudera’s 100% open-source Hadoop distribution, in minutes. Once connected, you can do ad-hoc visualization of the data so you can see patterns and outliers. You can’t get value from your data unless you can see what’s inside of it.

Date: Wednesday, Sep 18 2013

Description

If you’re using Hadoop, it’s probably because you’ve got data that is huge, unstructured, nested or all three. Hadoop’s distributed file system (HDFS) and the MapReduce framework support parallel processing across massive data or disparate data. Together, they let you work with data that traditional databases find extremely difficult to process. But storing and accessing big and messy data is only part of the problem. You’ve still got to make sense of it.

Tableau allows you to connect to CDH, Cloudera’s 100% open-source Hadoop distribution, in minutes. Once connected, you can do ad-hoc visualization of the data so you can see patterns and outliers. You can’t get value from your data unless you can see what’s inside of it.

“Tableau’s solution for Hadoop is one of the most elegant solutions I've seen.” said Ravi Bandaru, Product Manager of Advanced Analytics & Data Visualization at Nokia. “This obviates any need for us to move huge log data into Relational store before analyzing it with Tableau.”

Connecting to Hadoop with Cloudera’s ODBC driver is as easy as connecting to any other data source from Tableau, with no additional setup or programming required. This puts the powerful capabilities of Hadoop into the hands of anyone who wants to analyze data. And now, with Cloudera ImpalaTM, the industry’s first real-time query framework for Hadoop, users can also get interactive analytics directly on the massive volumes and variety of data stored in HDFS or HBase.

Next Steps