The Data Readiness Index 2026: Understanding the Foundations for Successful AI

See the results

Overview

This 3-day course by Cloudera Education introduces participants to implementing enterprise data governance and lineage using Cloudera Data Governance and Cloudera Data Lineage. Learners will gain practical knowledge of how to manage, control, classify, and trace data across the enterprise while supporting regulatory and compliance requirements such as GDPR and HIPAA.

The course covers key governance concepts including data classification, metadata management, data protection, lineage tracking, impact analysis, and policy enforcement to improve data transparency, regulatory compliance, and overall trust in enterprise data ecosystems.

Download full course description 

What Skills You Will Gain?

Participants Will Develop the Ability To:

  • Understand the role of Cloudera Data Lineage within a modern data governance framework

  • Navigate the Cloudera governance and lineage interfaces with confidence

  • Connect and analyze data sources for lineage and metadata visibility

  • Perform end-to-end lineage analysis for impact assessment and troubleshooting

  • Organize and catalog data using classifications and business glossary terms

  • Use profiling capabilities to better understand and manage data assets

  • Create and apply resource- and tag-based access control policies.

  • Implement data masking and row-level filtering policies for compliance

  • Leverage metadata intelligence and AI-driven insights for governance optimization

Who Should Take This Course?

This course is designed for data professionals responsible for managing, governing, and understanding enterprise data using the Cloudera platform and Cloudera Data Lineage. It is ideal for data stewards, data governance professionals, data analysts, data engineers, BI developers, and solution architects who are involved in regulatory compliance, metadata management, lineage analysis, and data access control.

Book the course

Course Details

Course Overview

  • Course Type: Instructor-led training

  • Level: Intermediate

  • Duration: 3 Days

  • Platform: Cloudera on Premises - Cloudera Data Lineage

Topics Covered:

  • Introduction to Cloudera Data Lineage

  • Auditing and Access Control

  • Connectivity and Data Source Integration

  • Cloudera Connector Factory.

  • Platform Navigation and Data Discovery

  • Deep Dive into Lineage and Impact Analysis

  • Knowledge Hub and Data Catalog Management

  • AI-Driven Insights and Optimization(Cloudera Lineage AI Assistant)

Introduction

  • Team activity: Team Introductions

Lineage

  • Inspecting Lineage

  • Propagation and Lineage in Atlas

  • Inspecting Lineage in Atlas

Data Governance Overview

  • What Is Data Governance?

  • Basic Concepts

  • SDX: Data Governance in Cloudera

Access Controls

  • Apache Ranger Basics

  • Creating Users and Roles

  • Resource-Based Policies

  • Tag-Based Policies

  • Securing Metadata Objects

  • Providing Partial Access

Organizing Data Objects

  • Searching for Objects by Type

  • Classifications

  • Glossary Terms

Auditing

  • Auditing Overview

  • Viewing Audit Information

Managing the Data Lifecycle

  • Governing the Data Lifecycle

Working with Data Catalog

  • Data Catalog Overview

  • Sensitive Data Profiler

  • Defining and Monitoring Data Quality

  • Preparing for Audits Using Data Catalog

  • Collaborating

Introduction to Cloudera Data Lineage & Core Concepts

  • What is Cloudera Data Lineage?

  • Understanding Data Lineage and its Importance

  • Overview of Cloudera data lineage's Key "Spaces"

  • Lab: Access to Cloudera Data Lineage Environment

Deep Dive into Lineage Space

  • Understanding Data Lineage Types in Cloudera Data Lineage 

  • Cross system Lineage in Cloudera Data Lineage 

  • Inner System Lineage in Cloudera Data Lineage 

  • End-to-End Column Lineage for Impact and Root Cause Analysis

  • Lab: Navigating to Cloudera Data Lineage Lineage Space

Cloudera Data lineage Connectivity & Data Source Integration

  • Overview of Cloudera Data lineage Connectivity

  • Cloudera Data Lineage Agent Installation

  • Types of Connectors

  • Demo: Cloudera Data Lineage Agent Installation

  • Demo: Cloudera Data Lineage Metadata Extraction

Live Lineage & Cloudera Lineage AI Assistant

  • Introduction to Live Lineage

  • Key features of Live Lineage

  • Introduction to Cloudera Lineage AI Assistant

  • Key Capabilities of Cloudera Lineage AI Assistant

  • Lab: Optimize & Validate Using Live Lineage & Cloudera Lineage AI Assistant

Cloudera Connector Factory

  • Overview of Cloudera Connector Factory

  • Cloudera Connector Factory Instructions

  • Types of Templates

  • Log Troubleshooting & Extraction Validation

  • Demo: Cloudera Connector Factory

Managing Data Assets with the Knowledge Hub

  • Introduction to the Knowledge Hub and Insight Dashboard

  • Key Capabilities for Data Asset Management

  • Collaboration and Governance Features

  • Lab: Populating and Managing Assets in the Knowledge Hub

Navigating the Cloudera Data Lineage 

Platform & Discovery Space

  • Cloudera Data Lineage User Interface Orientation

  • Introduction to the Automated Discovery Space

  • Lab: Navigating to Cloudera Data Lineage  User Interface h Automated Discovery Space

Ready to Get Started?

Your form submission has failed.

This may have been caused by one of the following:

  • Your request timed out
  • A plugin/browser extension blocked the submission. If you have an ad blocking plugin please disable it and close this message to reload the page.