From Data Swamp to Data Lakehouse: Using Metadata to Drive Discovery

Watch now

First Name

Last Name

Job Title

Business Email

Company

Phone

Country

By registering or submitting your data, you acknowledge, understand, and agree to Cloudera's Terms and Conditions, including our Privacy Statement.

By checking this box, you consent to receive marketing and promotional communications about Cloudera’s products and services and/or related offerings from us, or sent on our behalf, in accordance with our Privacy Statement. You may withdraw your consent by using the unsubscribe or opt-out link in our communications.

When data from different sources with different formats are dumped in a data lake, you end up with a “data swamp” where your data becomes unmanageable and un-navigable, overwhelming users.

In this highly requested session, we will dive into the “data swamp” problem and introduce the modern lakehouse paradigm. You will learn:

What core components make up a metadata catalog, and how to populate it with automated data lineage and metadata harvesting
How to use end-to-end search and discovery workflows to discover new datasets, understand schema evolution, and assess data quality
Best practices for integrating metadata into your CI/CD pipelines to keep your catalog fresh
Ways to optimize resources while increasing operational efficiency

Keep your data from getting bogged down in a data swamp—explore how a robust metadata catalog can turn an ungoverned data lake into a trusted lakehouse.

Speakers

Sr. Director, Product Management

Zinette Ezra

Global Director, AI Solutions

From Data Swamp to Data Lakehouse: Using Metadata to Drive Discovery

Watch now

Speakers

Zinette Ezra

Rameez Chatni

Your form submission has failed.

From Data Swamp to Data Lakehouse: Using Metadata to Drive Discovery

Watch now

Speakers

Zinette Ezra

Rameez Chatni

Enjoy? Share It

Your form submission has failed.