What's New in Cloudera Documentation

What's New in Cloudera Documentation in May, 2018

This section describes new topics added and major changes made to the Cloudera documentation library in May, 2018:

Product What's New Link
Cloudera Upgrade Added a new interactive topic that walks you through the steps to upgrade Cloudera Manager. You can select your operating system, upgrade version, and database type and a customized page displays the steps for your upgrade. Upgrading Cloudera Manager Using Packages
Added a new interactive topic that walks you through the steps to upgrade CDH using Cloudera Manager. You can select your Cloudera Manager version, CDH upgrade version, and other information and a customized page displays the steps for your upgrade. Upgrading CDH
HDFS Transparent Encryption Extensively revised the KMS ACL topic, which now includes descriptions of all operations for each ACL class, as well as a diagram and explanation that guides readers through the process of how the KMS evaluates the various ACL classes. Configuring KMS Access Control Lists (ACLs)
Key Trustee KMS HA Added new documentation for a feature that provides logic to detect and warn users about a potential problem where the GPG private keys have not been properly synchronized across all Key Trustee KMS HA hosts.
Cloudera Navigator HSM KMS Added a new topic to guide users through the steps to upgrade an HSM KMS. Upgrading Cloudera Navigator HSM KMS

What's New in Cloudera Documentation in March, 2018

This section describes new topics added and major changes made to the Cloudera documentation library in March, 2018:

Product What's New Link
Cloudera Data Science Workbench Added a new video that demonstrates how to get started with a Cloudera Data Science Workbench built-in template project. CDSW Quickstart Demo [Youtube]
New Known Issues added for Cloudera Manager and CDH integration. Known Issues
Added a new topic on migrating a CDSW Deployment to Another Host. Migrating a CDSW Deployment
Revamped the Backup topic with detailed instructions. Creating a Backup
Added a new topic on how to uninstall Cloudera Data Science Workbench. Uninstalling CDSW
JDK Requirements Added new section on Java Cryptography Extension (JCE) Unlimited Strength Jurisdiction requirements. Java Cryptography Extension (JCE) Unlimited Strength Jurisdiction
Navigator Navigator Audit Server documentation includes a new example of how to use audit events to determine what caused a schema change to a table. Use audit reports to identify the user or process that may be causing unwanted changes. Who ran which operation against a table?
Cloudera Director Added a new topic on using custom DNS names and DNS servers with auto-TLS. Using Custom DNS with Auto-TLS in AWS

What's New in Cloudera Documentation in February, 2018

This section describes new topics added and major changes made to the Cloudera documentation library in February, 2018:

Product What's New Link
Flume The Apache Flume content is moved to a new Flume Guide. Information for configuring, using, and managing Flume is consolidated in the Flume Guide. Flume Guide
HBase The Apache HBase content is moved to a new HBase Guide. All the information for configuring, managing, and troubleshooting HBase is in one central location. HBase Guide
Key HSM There is a new section describing the file naming convention used for encryption zone keys. Key Naming Convention
HDFS (Encryption) There is a new section describing how to resolve an error that can occur when the KMS jute buffer size is insufficient to hold all the tokens. KMS server jute buffer exception
Sentry The Apache Sentry content is moved to a new Sentry Guide. The Sentry Guide contains information on configuring, using, and troubleshooting Sentry, as well as how-to guides. Sentry Guide

What's New in Cloudera Documentation in January, 2018

This section describes new topics added and major changes made to the Cloudera documentation library in January, 2018:

Product What's New Link
Cloudera Data Science Workbench Released Cloudera Data Science Workbench 1.3.0.
Impala Added tip about using Kudu Java API, instead of JDBC interface, for rapid insert operations. Configuring Impala to Work with JDBC
Added DATE_TRUNC() function. Impala Date and Time Functions
BATCH_SIZE query option now has an upper limit. BATCH_SIZE Query Option
A new kind of runtime filter, the "min-max" filter, applies to join queries involving Kudu tables. Using Impala to Query Kudu Tables
Added new conditional operators: IS [NOT] TRUE, IS [NOT] FALSE, and IS [NOT] UNKNOWN. SQL Operators
Added information about changes to the output of the SET statement, dividing the options into multiple groups, and hiding some groups by default. New SET ALL syntax shows all the option groups. SET Statement
Added a new impala-shell option --query_option and configuration file section [impala.query_options]. These features both allow specifying values for query options when starting impala-shell. impala-shell Configuration Options
Kafka Updated examples and removed deprecated properties for how to use Kafka with Flume. Using Kafka with Flume
Kafka Updated Kafka upgrade topic to include versions. Rolling Upgrade to Kafka 3.0.x
Key Trustee KMS There is a new procedure for migrating from a Key Trustee KMS (KT KMS) to a Hardware Security Module KMS (HSM KMS). Migrating from a Key Trustee KMS to an HSM KMS
Cloudera Manager

ADLS Connectivity

You can now use Cloudera Manager to configure credentials for cluster access to Microsoft ADLS. This access is enabled for running Hive and Impala queries on tables backed by data stored in ADLS and to browse ADLS data using Hue.

Configuring ADLS Access Using Cloudera Manager

Performing Host Maintenance

To enable performing minor maintenance on cluster hosts, Cloudera Manager now fully manages the host decommission and recommission process. You can specify whether or not to replicate under-replicated data blocks to other DataNodes to maintain the cluster's replication factor during a maintenance window.
Tuning and Troubleshooting Host Decommissioning

BDR

Added examples for how to use the API to manage BDR.

How To Automate BDR Replication with the Cloudera Manager API

Video

Added a video walkthrough for how to add a cluster to Cloudera Manager.

View the video on YouTube.

View the video within the documentation.

Cloudera Director

LDAP and Active Directory

Cloudera Director 2.7 can be configured to point to an organization’s LDAP server so that users common credentials may be used to login to Cloudera Director. When enabling LDAP support, Cloudera Director’s built in user management is disabled.

Configuring Cloudera Director Server for LDAP and Active Directory

Director-managed Java Installation

Cloudera Director can now handle all aspects of Java installation on the instances that it allocates and configures for Cloudera Manager and CDH clusters, offering more flexibility while simplifying the process for users.

Deploying Java on Cluster Instances
A configuration option has been added to Cloudera Director's AWS plugin to accommodate regions like GovCloud and China, where EC2 cannot tag instances upon creation. The documentation now includes the procedure for configuring the plugin to use this option. Configuring Tag-on-create for AWS GovCloud (US) and China (Beijing) Regions
Cloudera Navigator

Group by for search results

Metadata searches in Navigator now include the ability to group search results by common properties. Group by lets you use technical, managed, and custom metadata to quickly identify small files, active SQL users, table-creation trends, and other data aggregation trends revealed by metadata properties.

The documentation includes some examples of how grouping search results can help you understand trends in your data and to find specific data assets.

Grouping Search Results Using Metadata

The Navigator role names have been updated to more clearly reflect the privileges they provide. One specific change is that the privilege for editing the name and description metadata for Navigator entities is now part of the Managed & Custom Metadata Editor role. Users with that role or the Full Administrator role can add and update entity names and descriptions in the Navigator console.

Cloudera Navigator User Roles

Audit filtering now allows a "not like" operator.

Filtering Audit Events
Cloudera Manager provides options to allow you to filter content from the Navigator audit logs. The documentation now includes a page of information for handling sensitive data that links to the Cloudera Manager log redaction details. Sensitive Data
The documentation now includes the specific metadata removed during Navigator Metadata Server purge tasks. What Metadata is Purged?
Kudu

New features and updates to Kudu administration

  • You can now add data directories to an existing master or tablet server
  • Kudu tablet servers are resilient to disk failures that occur on a disk storing data blocks.
  • The description of the workflow to migrate to multi-master cluster was improved with details and examples.
  • The description of the workflow to recover from a dead Kudu master was improved with details.
Kudu Administration
Specified how client applications connect to Kerberized Kudu servers. Client Authentication to Secure Kudu Clusters