How to Use Governance-Based Data Discovery

As of Cloudera Enterprise 5.11, Hue can use the metadata tagging, indexing, and search features available with Cloudera Navigator data management. After integrating Hue with Cloudera Navigator, existing Cloudera Navigator tags and indexed entities can be accessed and viewed in Hue, and entities can be tagged using Hue interfaces. Managed metadata and custom metadata tags created or applied using Hue are then stored in the Cloudera Navigator instance. This How To shows administrators how to enable this capability and SQL users how to use the feature.

SQL Users Get Started

This short tutorial shows you how to organize data better, how to create new tags, or re-use existing tags.

Applying Metadata Tags Using Hue

Prepare Hue Tables

As the Hue superuser, install sample tables and then refresh Impala metadata.
  1. Log in to Hue (as superuser) by selecting Hue Web UI in Cloudera Manager.
  2. Go to About > Step 2: Examples and install sample tables for Hive and Impala.
  3. Go to Query Editors > Impala and click the refresh icon.
  4. Select Perform incremental metadata update to display sample tables.
  5. Go to Metastore Tables Manager and click the refresh icon.

Tag Database, Table, and Field

  1. In Metastore Tables Manager, click the default (or some other) database.
  2. Add database tag: Hover over TAGS, click the edit icon, enter a tag of your choice, and save.
  3. Add table tag: Click a table name (such as "customers"), hover over TAGS, and repeat.
  4. Add field tag: Click the "show details" icon by a column name and repeat in the context popup.

Search Cloudera Navigator Metadata with Hue

After integrating Hue with Cloudera Navigator, the Hue SQL Editor provides a Search bar that includes a list of filters and an auto-complete listing that is pre-filled with top values. The Search mechanism by default returns tables and views only. Use the type filter to search for columns, partitions, and databases. The Cloudera Navigator search field can be accessed in the Metastore Tables Manager as well as from the Hive and Impala editors.