This is the documentation for Cloudera Manager 4.8.2.
Documentation for other versions is available at Cloudera Documentation.

Sending Usage and Diagnostic Data to Cloudera

Cloudera Manager collects anonymous usage information and takes regularly-scheduled snapshots of the state of your cluster and automatically sends them anonymously to Cloudera. This helps Cloudera improve and optimize Cloudera Manager.

If you are a Cloudera Enterprise user, you can also trigger the collection of diagnostic data and send it to Cloudera Support to aid in resolving a problem you may be having.

Continue reading:

Anonymous Usage Data Collection

Cloudera Manager sends anonymous usage information using Google Analytics to Cloudera. The information helps Cloudera improve Cloudera Manager. By default anonymous usage data collection is enabled.

  1. From the Administration tab, select Settings.
  2. Under the Other category, set the Allow Usage Data Collection property.
  3. Click Save Changes.

Managing Hue Analytics Data Collection

Hue tracks anonymised pages and application versions in order to gather information to help compare each application's usage levels. The data collected does not include any hostnames or IDs. For example, the data is of the form: /2.3.0/pig, /2.5.0/beeswax/execute. You can restrict data collection as follows:
  1. Go to the Hue service.
  2. Select Configuration > View and Edit.
  3. Expand the Service-Wide category.
  4. Uncheck the Enable Usage Data Collection checkbox.
  5. Click Save Changes.
  6. Restart the Hue service.

Diagnostic Data Collection

To help with solving problems when using Cloudera Manager on your cluster, Cloudera Manager collects diagnostic data on a regular schedule, and automatically sends it to Cloudera. By default Cloudera Manager is configured to collect data weekly and to send it automatically. You can schedule the frequency of data collection on a daily, weekly, or monthly schedule, or disable the scheduled collection of data entirely. You can also send a collected data set manually.

What Data Does Cloudera Manager Collect?

Cloudera Manager collects and returns a significant amount of information about the health and performance of the cluster. It includes the following:

  • Up to 1000 Cloudera Manager audit events: Configuration changes, add/remove of users, roles, services, etc.
  • One day's worth of Cloudera Manager events: This includes critical errors Cloudera Manager watches for and more.
  • Data about the cluster structure which includes a list of all hosts, roles, and services along with the configurations that are set through Cloudera Manager. Where passwords are set in Cloudera Manager, the passwords are not returned.
  • Cloudera Manager license and version number.
  • Current health information for hosts, service, and roles. Includes results of health tests run by Cloudera Manager.
  • Heartbeat information from each host, service, and role. These include status and some information about memory, disk, and processor usage.
  • The results of running Host Inspector.
  • One day's worth of Cloudera Manager metrics.
      Note: If you are using Cloudera Standard, Host metrics are not included.
  • A download of the debug pages for Cloudera Manager roles.
  • For each machine in the cluster, the result of running a number of system-level commands on that machine.
  • Logs from each role on the cluster, as well as the Cloudera Manager server and agent logs.

Configuring the Frequency of Diagnostic Data Collection

By default, Cloudera Manager collects diagnostic data on a weekly basis. You can change the frequency to daily, weekly, monthly, or never. If you are a Cloudera Enterprise customer and you set the schedule to never you can still collect and send data to Cloudera on demand. If you are a Cloudera Express customer and you set the schedule to never, data is not collected or sent to Cloudera.

  1. Select Administration > Settings.
  2. Under the Support category, click the Value column for the property Scheduled Diagnostic Data Collection Frequency and select the frequency you want.
  3. To set the day and time of day that the collection will be performed, click the Value column for the property Scheduled Diagnostic Data Collection Time and specify the date and time in the pop-up control.
  4. Click Save Changes.

You can see the current setting of the data collection frequency by viewing Support > Scheduled Diagnostics: in the main navigation bar.

Specifying the Diagnostic Data Directory

You can configure the directory where collected data is stored.

  1. Select Administration > Settings.
  2. Under the Support category, set the Diagnostic Data Bundle Directory to a directory on the host running Cloudera Manager Server. The directory must exist and be enabled for writing by the user cloudera-scm.
  3. Click Save Changes.

Collecting and Sending Diagnostic Data to Cloudera

  Note: This feature is available only with a Cloudera Enterprise license.
For other products, the following applies:
  • Cloudera Standard - the feature is not available.
  • Cloudera Enterprise Trial Edition - the feature will not be available after your trial license expires.
To obtain a license for Cloudera Enterprise, please fill in this form or call 866-843-7207. After you install your Cloudera Enterprise license, the feature will be available.
  Note:
  • Automatically sending diagnostic data requires the Cloudera Manager Server host to have Internet access, and be configured for sending data automatically. If your Cloudera Manager server does not have Internet access, you can manually send the diagnostic data as described in Sending Diagnostic Data from a Manually Triggered Collection to Cloudera.
  • Automatically sending diagnostic data may fail sometimes and return an error message of "Could not send data to Cloudera." To work around this issue, you can manually send the data to Cloudera Support.

Disabling the Automatic Sending of Diagnostic Data from a Manually Triggered Collection

If you do not want data sent to Cloudera automatically after triggering data collection, you can disable this feature. The data you collect will be saved and can be downloaded for sending to Cloudera Support at a later time.

  1. From the Administration tab, select Settings.
  2. Under the Support category, uncheck the box for Send Diagnostic Data to Cloudera Automatically.
  3. Click Save Changes.
  Note: The Send Diagnostic Data form displayed when you collect data in one of the following procedures indicates whether the data will be sent automatically.

Manually Triggering Collection and Transfer of Diagnostic Data to Cloudera

  1. Click the Support menu link.
  2. Choose Send Diagnostic Data. This opens the Send Diagnostic Data form.
  3. Fill in or change the information here as appropriate.
    • To change the System Identifier, from the Administration tab select Settings and go to theOther category.
    • Cloudera Manager populates the End Time based on the setting of the Time Range Selector. You should change this to be a few minutes after you observed the problem or condition that you are trying to capture. Note that the time range is based on the timezone of the host where Cloudera Manager server is running.
    • If you have a support ticket open with Cloudera support, please include the support ticket number in the field provided.
  4. Click Collect Diagnostic Data. A Running Commands window shows you the progress of the data collection steps. When these steps are complete, the collected data is sent to Cloudera.

Sending Diagnostic Data from a Manually Triggered Collection to Cloudera

  1. Click the Support menu link.
  2. Choose Send Diagnostic Data. This opens the Send Diagnostic Data form.
  3. Fill in or change the information here as appropriate.
    • To change the System Identifier, from the Administration tab select Settings and go to theOther category.
    • Cloudera Manager populates the End Time based on the setting of the Time Range Selector. You should change this to be a few minutes after you observed the problem or condition that you are trying to capture. Note that the time range is based on the timezone of the host where Cloudera Manager server is running.
    • If you have a support ticket open with Cloudera support, please include the support ticket number in the field provided.
  4. Click Collect Diagnostic Data. A Command Details window shows you the progress of the data collection steps.
  5. In the Command Details window, click Download Result Data to download and save a zip file of the information collected, on a host that has Internet access.
  6. Send the data to Cloudera Support by doing one of the following:
      1. Download the phone_home script.
      2. Copy the script and the downloaded data file to a host that has Internet access.
      3. Run the following command on that host:
        python phone_home.py --file <downloaded data file>
    • Contact Cloudera Support and arrange to send the data file.