Superset in Trucking IoT
Overview
NOTICE
As of January 31, 2021, this tutorial references legacy products that no longer represent Cloudera’s current product offerings.
Please visit recommended tutorials:
- How to Create a CDP Private Cloud Base Development Cluster
- All Cloudera Data Platform (CDP) related tutorials
Introduction
Superset is a Business Intelligence tool packaged with many features for designing, maintaining and enabling the storytelling of data through meaningful data visualizations. The trucking company you work at has a Trucking IoT Application that processes the truck and traffic data it receives from sensors, but the businesses leaders are not able to make sense of the data. They hired you as a Data Visualization Analyst to tell a story through visualizing this application's data, such as how the traffic congestion levels impact truck driver performance, which ultimately affect the company. Therefore, your communication of your insights to business leaders will influence them to take action based on your recommendations.
Objective
- Learn Data Visualization Concepts
- Become familiar with Apache Superset
- Learn to Design Visualizations with Superset
Prerequisites
- Downloaded and deployed the Hortonworks Data Platform (HDP) Sandbox
- Must have 32GB of dedicated RAM for HDP Sandbox
- Enabled Connected Data Architecture:
Outline
- Superset Concepts - Covers the fundamental concepts of Data Visualization and Superset.
- Setting up the Development Environment - Setup hostname mapping to IP address, setup Ambari admin password, turn on services needed for Superset and turn on Superset.
- Visualizing Trucking Data - Shows how to visualize data using Superset.
Tutorial Reference Application
This tutorial series uses our Trucking IoT Application comprised of multiple sub-projects. You will build the Superset visualization subproject.