Your browser is out of date

Update your browser to view this website correctly. Update my browser now

×

Cloudera's OnDemand training course for CDP Public Cloud provides the fundamental knowledge necessary to carry out the planning, provisioning, configuration, monitoring, and management tasks required of an administrator for the Cloudera Data Platform (CDP) Public Cloud deployment. This course uses the CDP web interface extensively and also provides information about the use of the CDP Command Line Interface (CLI). 

What You Will Learn

Through instructor-led discussion and demonstrations, you will learn how to:

  • Describe the architecture and key components of CDP Public Cloud
  • Access and navigate the CDP Public Cloud web interface
  • Understand and use important features of the CDP Management Console
  • Configure an Identity Provider (IdP) for identity federation
  • Set up users and groups with appropriate access and roles
  • Register an environment to connect CDP with a public cloud provider account
  • Configure and manage Data Hub clusters
  • Provision and configure data warehouses and machine learning workspaces
  • Describe the role of SDX services, such as Apache Ranger and Apache Atlas, in data security and governance
  • Use Cloudera Manager and Workload Manager for monitoring and management
  • Troubleshoot clusters, services, and jobs
  • Set up and use Replication Manager for backup, recovery, disaster recovery, and data migration
  • Install, configure, and use the CDP Command Line Interface (CLI)

 

Book the course

How would you like to train?

What to Expect

This course is best suited to systems administrators. Students should have experience working in a Linux environment with standard Linux system commands. Students should be able to read and execute basic Linux shell scripts and have some experience with the JSON data format. In addition, it is recommended for students to have some operational experience with cloud computing practices and exposure to big data concepts and applications. 

Course Topics

The Enterprise Data Cloud

  • Industry Trends for Big Data
  • The Challenge to Become Data-Driven
  • The Enterprise Data Cloud

The Cloudera Data Platform

  • CDP Overview
  • CDP Form Factors

CDP Public Cloud Architecture

  • CDP Public Cloud Architecture Overview
  • Environments
  • Data Lake
  • Storage
  • Compute

Getting Started with CDP Administration

  • The Role of a CDP Administrator
  • Tasks for Getting Started as a CDP Administrator

Accessing and Navigating CDP

  • Accessing CDP
  • Navigating the CDP Console

CDP Tour

  • Accessing CDP
  • The Dashboard
  • Overview of CDP Services and Applications

User and Group Management

  • User and Group Management Overview
  • Onboarding Users
  • CDP User Accounts
  • CDP Groups
  • Roles and Resource Roles
  • API Access Keys
  • SSH Keys

Environments

  • Environment Overview
  • Environment Prerequisites
  • Registering an Environment
  • Accessing and Managing an Environment

Data Hub Clusters

  • Data Hub Cluster Overview
  • Data Hub Cluster Key Concepts
  • Data Hub Deployment Planning
  • Basic Data Hub Creation
  • Accessing Data Hub Clusters
  • Managing Data Hub Clusters
  • Advanced Cluster Configuration
  • Troubleshooting Clusters

Setting Up a Data Warehouse

  • Data Warehouse Service Overview
  • Activating an Environment
  • Adding and Managing a Database Catalog
  • Adding and Tuning a Virtual Warehouse
  • Querying a Data Warehouse
  • Event Stream Analytics
  • Monitoring with Grafana
  • Troubleshooting Cloudera Data Warehouse

Provisioning a Machine Learning Workspace

  • Cloudera Machine Learning Overview
  • CML Engines
  • AWS Requirements for CML Workspaces
  • Provisioning a CML Workspace
  • CML Workspace Administration
  • CML Auto-Scaling
  • Monitoring with Grafana

Monitoring and Management

  • Monitoring and Management in CDP Public Cloud
  • Getting Started with Monitoring in CDP
  • Data Lake Monitoring and CDP Auditing
  • Monitoring with Cloudera Manager and Workload Manager
  • Monitoring Clusters, Hosts, Services, and Activities
  • Troubleshooting Cluster Configuration and Operation

Classic Clusters

  • Understanding Classic Clusters
  • Prerequisites for Adding Classic Clusters
  • Registering Classic Clusters
  • Managing Classic Clusters in CDP

Command Line Interface

  • CDP CLI Overview
  • Setting up the CDP CLI
  • Configuring and Working with the CDP CLI
  • CLI Modules

Replication Manager

  • Replication Manager Overview
  • Replication Use Cases
  • The Replication Manager Service
  • Preparing to Set up Replication
  • Replication Policies and Policy Operations

Security and Governance

  • Security and Governance in CDP
  • Security Overview
  • Data Lake Security: SDX 
  •  Authorization and Auditing with Apache Ranger
  • Data Governance Overview
  • Lineage and Governance with Apache Atlas
  • Data Catalog: Understand, Secure, and Govern Your Data

Migration to CDP Public Cloud

  • Overview of the Migration Process
  • CDH and HDP Migration to CDP Public Cloud
  • Using the Migration Guides
  • Migration of Data, Components, and Services

Learn more

CCA Spark and Hadoop Developer Certification

This course is excellent preparation for the CCA Spark and Hadoop Developer exam. Although we recommend further training and hands-on experience before attempting the exam, this course covers many of the subjects tested. 

Certification is a great differentiator. It helps establish you as a leader in the field, providing employers and customers with tangible evidence of your skills and expertise.

Advance your career

Big data developers are among the world's most in-demand and highly-compensated technical roles. Check out some of the job opportunities currently listed that match the professional profile, many of which seek CCA qualifications.

Private training

We also provide private training at your site, at your pace, and tailored to your needs.

Your form submission has failed.

This may have been caused by one of the following:

  • Your request timed out
  • A plugin/browser extension blocked the submission. If you have an ad blocking plugin please disable it and close this message to reload the page.