Exercise 5: Cloudera Navigator
As you click on the nodes in this graph, more detail will appear. If you click on the tokenized_access_logs table and the intermediate_access_logs table, you'll see arrows for each individual field running through that query. You can see how quickly you could trace the origin of datasets even in a much busier and more complicated environment!
Now you've shown where the data came from, but we still need to show what's been done with it. Go to the 'Audits' tab, using the link in the top-right corner.
If you would like to automatically apply this tag to data as it changes, return to the policies tab and edit the policy's schedule to be "On Data Change". Then the tag will be applied to any file that is assigned these permissions in the future.
You've now experienced how to use Cloudera Navigator for discovery of data and metadata. This powerful tool makes it easy to audit access, trace data lineage, and enforce policies.
With more data, and more data formats available in a multi-tenant environment, data lineage and governance are getting challenging. Cloudera Navigator provides enterprise-grade governance that's built into the foundation of Apache Hadoop.
You can learn more about the various management features provided by Cloudera Manager in the Cloudera Administrator Training for Apache Hadoop.