Your browser is out of date

Update your browser to view this website correctly. Update my browser now


This four-day instructor-led training course provides students with the foundational knowledge required to plan, deploy, configure, and manage a cluster running the Hortonworks Data Platform (HDP).

What You Will Learn

Students who successfully complete this course will learn how to administer Apache Hadoop and the Hortonworks Data Platform (HDP). You will be able to:

  • Install the Hortonworks Data Platform
  • Manage Hadoop services
  • Use and manage Hadoop Distributed File System (HDFS) Storage
  • Configure rack awareness
  • Manage cluster nodes and cluster node storage
  • Use HDFS snapshots and Distributed Copy (DistCp)
  • Configure heterogeneous storage and HDFS centralized cache
  • Configure an HDFS NFS gateway and NameNode high availability
  • Describe the View File System (ViewFS)
  • Manage YARN resources and run YARN applications
  • Configure the YARN capacity scheduler, containers, and queues to manage computing resources
  • Configure YARN node labels and YARN ResourceManager high availability
  • Manage Ambari alerts
  • Deploy an HDP cluster using Ambari blueprints
  • Upgrade a cluster to a newer version of HDP

What to Expect

This course is designed primarily for system administrators and system operators responsible for installing, configuring, and managing an HDP cluster.

Students must have experience working in a Linux environment with standard Linux system commands. Students should be able to read and execute basic Linux shell scripts. In addition, we recommend that students have some operational experience in data center practices.

Book the course

How would you like to train?

Course Contents

Hortonworks Data Platform 3.1.0

  • Apache Hadoop
  • Hortonworks Data Platform Frameworks
  • Hadoop Cluster Management
  • Apache Ambari

Installing the Hortonworks Data Platform

  • Hadoop Deployment Options
  • Planning a Hadoop Cluster Deployment
  • HDP Installation Using Apache Ambari
  • Installing Ambari

Managing Ambari Users and Groups

  • Ambari Users vs. Hadoop Users
  • Managing Users, Groups, and Permissions

Managing Hadoop Services

  • Core Hadoop Configuration Files
  • Ambari Web UI
  • Managing Hadoop Configuration Properties with Ambari
  • Client Configuration Files

Using HDFS Storage

  • Hadoop Distributed File System (HDFS)
  • HDFS Shell Operations
  • Ambari Files View
  • Using Web HDFS
  • Using HDFS Access Control Lists

Managing HDFS Storage

  • HDFS Architecture and Operation
  • Managing HDFS Using UIs
  • Managing HDFS Using Command-Line Tools
  • Managing HDFS Quotas

Configuring Rack Awareness

  • Rack Awareness
  • Configuring Rack Awareness

Managing Cluster Nodes

  • Working with Cluster Nodes
  • Adding a Worker Node
  • Decommissioning and Recommissioning a Worker Node
  • Deleting a Worker Node
  • Moving a Master Component

Managing Cluster Nodes Storage

  • Rebalance HDFS
  • The HDFS Disk Balancer

HDFS Snapshots

  • Hadoop Backups
  • Using HDFS Snapshots
  • Using DistCp

Configuring Heterogeneous HDFS Storage

  • HDFS Storage
  • HDFS Storage Types and Policies
  • Configuring Storage Types and Policies

Configuring HDFS Centralized Cache

  • HDFS Centralized Cache

Managing the HDFS NFS Gateway

  • The HDFS NFS Gateway

High Availability for NameNode

  • NameNode HA
  • Configuring NameNode HA using Ambari

Configuring View File System

  • View File System
  • Configuring View File System

YARN Resource Management

  • YARN Resource Management
  • YARN Architecture and Operation
  • YARN Management Options
  • YARN Component Failure Management

YARN Applications

  • Understanding YARN Application Basics

The YARN Capacity Scheduler

  • Capacity Scheduler Operation
  • Configuring and Managing YARN Queues
  • Queue Access Control

YARN Node Labels

  • YARN Node Labels

High Availability for YARN Resource Manager

  • Resource Manager HA
  • Configuring Resource Manager HA using Ambari

Monitoring a Cluster

  • Ambari Metrics
  • Ambari Dashboard
  • Ambari Alerts
  • Configuring Ambari Alerts

Automating Cluster Provisioning Using Ambari Blueprints

  • Blueprint Scenarios
  • Blueprint Usage Overview
  • Logical Cluster Definition File
  • Cluster Creation Template File
  • Host Creation Template File
  • Configuration Property Best Practices

Upgrading HDP

  • The HDP Stack
  • HDP Upgrade Types
  • Preparing Databases and HDFS
  • Registering a New HDP Version
  • Installing the New HDP Version
  • Express Upgrade
  • Example of a Rolling Upgrade

Cloudera’s instructor was excellent, offering clear and concise training that was easy to understand. His wide-ranging peripheral knowledge helped apply the course materials to real-world situations. I look forward to attending another course.


Your form submission has failed.

This may have been caused by one of the following:

  • Your request timed out
  • A plugin/browser extension blocked the submission. If you have an ad blocking plugin please disable it and close this message to reload the page.