This four-day instructor-led training course provides students with the foundational knowledge required to plan, deploy, configure, and manage a cluster running the Hortonworks Data Platform (HDP).
What You Will Learn
Students who successfully complete this course will learn how to administer Apache Hadoop and the Hortonworks Data Platform (HDP). You will be able to:
- Install the Hortonworks Data Platform
- Manage Hadoop services
- Use and manage Hadoop Distributed File System (HDFS) Storage
- Configure rack awareness
- Manage cluster nodes and cluster node storage
- Use HDFS snapshots and Distributed Copy (DistCp)
- Configure heterogeneous storage and HDFS centralized cache
- Configure an HDFS NFS gateway and NameNode high availability
- Describe the View File System (ViewFS)
- Manage YARN resources and run YARN applications
- Configure the YARN capacity scheduler, containers, and queues to manage computing resources
- Configure YARN node labels and YARN ResourceManager high availability
- Manage Ambari alerts
- Deploy an HDP cluster using Ambari blueprints
- Upgrade a cluster to a newer version of HDP
What to Expect
This course is designed primarily for system administrators and system operators responsible for installing, configuring, and managing an HDP cluster.
Students must have experience working in a Linux environment with standard Linux system commands. Students should be able to read and execute basic Linux shell scripts. In addition, we recommend that students have some operational experience in data center practices.
Book the course
- How would you like to train?
- Instructor-Led
- Virtual Classroom
- Private
Course Contents
Hortonworks Data Platform 3.1.0
- Apache Hadoop
- Hortonworks Data Platform Frameworks
- Hadoop Cluster Management
- Apache Ambari
Installing the Hortonworks Data Platform
- Hadoop Deployment Options
- Planning a Hadoop Cluster Deployment
- HDP Installation Using Apache Ambari
- Installing Ambari
Managing Ambari Users and Groups
- Ambari Users vs. Hadoop Users
- Managing Users, Groups, and Permissions
Managing Hadoop Services
- Core Hadoop Configuration Files
- Ambari Web UI
- Managing Hadoop Configuration Properties with Ambari
- Client Configuration Files
Using HDFS Storage
- Hadoop Distributed File System (HDFS)
- HDFS Shell Operations
- Ambari Files View
- Using Web HDFS
- Using HDFS Access Control Lists
Managing HDFS Storage
- HDFS Architecture and Operation
- Managing HDFS Using UIs
- Managing HDFS Using Command-Line Tools
- Managing HDFS Quotas
Configuring Rack Awareness
- Rack Awareness
- Configuring Rack Awareness
Managing Cluster Nodes
- Working with Cluster Nodes
- Adding a Worker Node
- Decommissioning and Recommissioning a Worker Node
- Deleting a Worker Node
- Moving a Master Component
Managing Cluster Nodes Storage
- Rebalance HDFS
- The HDFS Disk Balancer
HDFS Snapshots
- Hadoop Backups
- Using HDFS Snapshots
- Using DistCp
Configuring Heterogeneous HDFS Storage
- HDFS Storage
- HDFS Storage Types and Policies
- Configuring Storage Types and Policies
Configuring HDFS Centralized Cache
- HDFS Centralized Cache
Managing the HDFS NFS Gateway
- The HDFS NFS Gateway
High Availability for NameNode
- NameNode HA
- Configuring NameNode HA using Ambari
Configuring View File System
- View File System
- Configuring View File System
YARN Resource Management
- YARN Resource Management
- YARN Architecture and Operation
- YARN Management Options
- YARN Component Failure Management
YARN Applications
- Understanding YARN Application Basics
The YARN Capacity Scheduler
- Capacity Scheduler Operation
- Configuring and Managing YARN Queues
- Queue Access Control
YARN Node Labels
- YARN Node Labels
High Availability for YARN Resource Manager
- Resource Manager HA
- Configuring Resource Manager HA using Ambari
Monitoring a Cluster
- Ambari Metrics
- Ambari Dashboard
- Ambari Alerts
- Configuring Ambari Alerts
Automating Cluster Provisioning Using Ambari Blueprints
- Blueprint Scenarios
- Blueprint Usage Overview
- Logical Cluster Definition File
- Cluster Creation Template File
- Host Creation Template File
- Configuration Property Best Practices
Upgrading HDP
- The HDP Stack
- HDP Upgrade Types
- Preparing Databases and HDFS
- Registering a New HDP Version
- Installing the New HDP Version
- Express Upgrade
- Example of a Rolling Upgrade