Your browser is out of date

Update your browser to view this website correctly. Update my browser now



This course provides a technical overview of Apache Hadoop. It includes high-level information about concepts, architecture, operation, and uses of the Hortonworks Data Platform (HDP) and the Hadoop ecosystem. The course provides an optional primer for those who plan to attend a hands-on, instructor-led course.

This course is offered in both Live Instructor-Led format, or get started now with our FREE self-paced Apache Hadoop Essentials course.

Register for free Self-Paced Course

  • Prerequisites
    No previous Hadoop or programming knowledge is required.Students will need browser access to the Internet.
  • Target Audience
    Data architects, data integration architects, managers, C-level executives, decision makers, technical infrastructure team, and Hadoop administrators or developers who want to understand the fundamentals of Big Data and the Hadoop ecosystem.

Book the course

How would you like to train?
Day 1

HDP Overview: Apache Hadoop Essentials

  • The Case for Hadoop
  • The Hadoop Ecosystem
  • HDFS Architecture
  • Ingesting Data
  • Parallel Processing
  • Apache Hive Overview
  • Apache Pig Overview
  • Apache Spark Overview
  • YARN Architecture
  • Hadoop Security
  • Operational Overview with Ambari
  • Loading Data into HDFS
  • Streaming Data into HDFS
  • Processing with MapReduce
  • Data Manipulation with Hive
  • Risk Analysis with Pig
  • Risk Analysis with Spark
  • Securing Ranger with Hive

Your form submission has failed.

This may have been caused by one of the following:

  • Your request timed out
  • A plugin/browser extension blocked the submission. If you have an ad blocking plugin please disable it and close this message to reload the page.