Step 6: Install CDH and Other Software

Step 1: Configure a Repository Step 2: Install Java Step 3: Install Cloudera Manager Server Step 4: Install Databases Step 5: Set Up the Cloudera Manager Database Step 6: Install CDH and Other Software Step 7: Set Up a Cluster

After setting up the Cloudera Manager database, start Cloudera Manager Server, and log in to the Cloudera Manager Admin Console:

  1. Start Cloudera Manager Server:
    • RHEL 7 compatible:
      sudo systemctl start cloudera-scm-server
    • RHEL 6 compatible, Ubuntu, SLES:
      sudo service cloudera-scm-server start
  2. Wait several minutes for the Cloudera Manager Server to start. To observe the startup process, run the following on the Cloudera Manager Server host:
    sudo tail -f /var/log/cloudera-scm-server/cloudera-scm-server.log
    When you see this log entry, the Cloudera Manager Admin Console is ready:
    INFO WebServerImpl:com.cloudera.server.cmf.WebServerImpl: Started Jetty server.

    If the Cloudera Manager Server does not start, see Troubleshooting Installation Problems.

  3. In a web browser, go to http://<server_host>:7180, where <server_host> is the FQDN or IP address of the host where the Cloudera Manager Server is running.
  4. Log into Cloudera Manager Admin Console. The default credentials are:

    Username: admin

    Password: admin

  5. After you log in, the End User License Terms and Conditions page displays. Read the terms and conditions and then check the box labeled Yes, I accept the End User License Terms and Conditions to accept them. Click Continue, and the installation wizard launches.

Welcome to Cloudera Manager

From the Welcome to Cloudera Manager page, you can select the edition of Cloudera Manager to install and, optionally, install a license:

  1. Choose which edition to install:
    • Cloudera Express, which does not require a license, but provides a limited set of features.
    • Cloudera Enterprise Cloudera Enterprise Trial, which does not require a license, but expires after 60 days and cannot be renewed.
    • Cloudera Enterprise with one of the following license types:
      • Basic Edition
      • Flex Edition
      • Cloudera Enterprise
    If you choose Cloudera Express or Cloudera Enterprise Cloudera Enterprise Trial, you can upgrade the license at a later time. For more information, see Managing Licenses.
  2. If you select Cloudera Enterprise, install a license:
    1. Click the Select License File text field.
    2. Browse to the location of your license file, select the file, and then click Open.
    3. Click the Upload button.
  3. Information is displayed indicating what the CDH installation includes. At this point, you can click the Support drop-down menu to access online Help or the Support Portal.
  4. Click Continue to proceed with the installation.

Thank you for choosing Cloudera Manager and CDH

The Thank you for choosing Cloudera Manager and CDH page lists the software that is available to be installed. Click Continue to proceed with the installation.

Specify hosts for your CDH cluster installation

Choose which hosts will run CDH and managed services

  1. Do one of the following depending on whether you are using Cloudera Manager to install software:
    • If you are using Cloudera Manager to install software, search for and choose hosts:
      1. To enable Cloudera Manager to automatically discover hosts on which to install CDH and managed services, enter the cluster hostnames or IP addresses. You can also specify hostname and IP address ranges. For example:
        Range Definition Matching Hosts
        10.1.1.[1-4] 10.1.1.1, 10.1.1.2, 10.1.1.3, 10.1.1.4
        host[1-3].example.com host1.example.com, host2.example.com, host3.example.com
        host[07-10].example.com host07.example.com, host08.example.com, host09.example.com, host10.example.com

        You can specify multiple addresses and address ranges by separating them with commas, semicolons, tabs, or blank spaces, or by placing them on separate lines. Use this technique to make more specific searches instead of searching overly wide ranges. The scan results will include all addresses scanned, but only scans that reach hosts running SSH will be selected for inclusion in your cluster by default. If you do not know the IP addresses of all of the hosts, you can enter an address range that spans over unused addresses and then clear the hosts that do not exist (and are not discovered) later in this procedure. However, keep in mind that wider ranges will require more time to scan.

      2. Click Search. Cloudera Manager identifies the hosts on your cluster to allow you to configure them for services. If there are a large number of hosts on your cluster, wait a few moments to allow them to be discovered and shown in the wizard. If the search is taking too long, you can stop the scan by clicking Abort Scan. To find additional hosts, click New Search, add the host names or IP addresses and click Search again. Cloudera Manager scans hosts by checking for network connectivity. If there are some hosts where you want to install services that are not shown in the list, make sure you have network connectivity between the Cloudera Manager Server host and those hosts. Common causes of loss of connectivity are firewalls and interference from SELinux.
      3. Verify that the number of hosts shown matches the number of hosts where you want to install services. Clear host entries that do not exist and clear the hosts where you do not want to install services.
  2. Click Continue.

The Select Repository screen displays.

Select Repository

  1. Select the repository type to use for the installation. In the Choose Method section select one of the following:
    • Use Parcels (Recommended)

      A parcel is a binary distribution format containing the program files, along with additional metadata used by Cloudera Manager. Parcels are required for rolling upgrades. For more information, see Parcels.

    • Use Packages

      A package is a standard binary distribution format that contains compiled code and meta-information such as a package description, version, and dependencies. Packages are installed using your operating system package manager.

  2. Select the version of CDH to install.
    1. If you selected Use Parcels (Recommended) and you do not see the version you want to install, click the More Options button to add the repository URL for your version. Repository URLs for CDH 5 are documented in CDH Download Information. After adding the repository, click Save Changes and wait a few seconds for the version to appear. If your Cloudera Manager host uses an HTTP proxy, click the Proxy Settings button to configure your proxy.
    2. If you selected Use Packages, and the version you want to install is not listed, you can select Custom Repository to specify a repository that contains the desired version. Repository URLs for CDH 5 version are documented in CDH Download Information.
  3. If you selected Use Parcels (Recommended), specify any Additional Parcels you want to install.
  4. Select the specific release of the Cloudera Manager Agent you want to install on your hosts. You can choose either the version that matches the Cloudera Manager Server you are currently using (recommended) or specify a different version in a custom repository. If you opted to use custom repositories for installation files, you can provide a GPG key URL that applies for all repositories. Repository and GPG key URLs are documented in CDH Download Information.
  5. Click Continue.

The Accept JDK License screen displays.

Accept JDK License

To allow Cloudera Manager to automatically install the Oracle JDK on cluster hosts, read the JDK license and check the box labeled Install Oracle Java SE Development Kit (JDK 7) if you accept the terms. If you installed your own Oracle JDK version in Step 2: Install Java Development Kit, leave the box unchecked.

If you allow Cloudera Manager to install the JDK, a second checkbox appears, labeled Install Java Unlimited Strength Encryption Policy Files. These policy files are required to enable AES-256 encryption in JDK versions lower than 1.8u161. JDK 1.8u161 and higher enable unlimited strength encryption by default, and do not require policy files.

After reading the license terms and checking the applicable boxes, click Continue.

The Single User Mode page displays.

Single User Mode

The Single User Mode page allows you to enable single user mode. Single user mode is not recommended for most environments. If you need to enable single user mode, check the box labeled Enable Single User Mode. Otherwise, leave it unchecked, and click Continue. The Enter Login Credentials page displays.

Enter Login Credentials

  1. Select root for the root account, or select Another user and enter the username for an account that has password-less sudo privileges.
  2. Select an authentication method:
    • If you choose password authentication, enter and confirm the password.
    • If you choose public-key authentication, provide a passphrase and path to the required key files.

    You can modify the default SSH port if necessary.

  3. Specify the maximum number of host installations to run at once. The default and recommended value is 10. You can adjust this based on your network capacity.
  4. Click Continue.

The Install Agents page displays.

Install Agents

The Install Agents page displays the progress of the installation. You can click on the Details link for any host to view the installation log. If the installation is stalled, you can click the Abort Installation button to cancel the installation and then view the installation logs to troubleshoot the problem.

If the installation fails on any hosts, you can click the Retry Failed Hosts to retry all failed hosts, or you can click the Retry link on a specific host.

If you selected the option to manually install agents, see Manually Install Cloudera Software Packages for the procedure and then continue with the next steps on this page.

After installing the Cloudera Manager Agent on all hosts, click Continue.

If you are using parcels, the Install Parcels page displays. If you chose to install using packages, the Inspect Hosts page displays.

Install Parcels

If you selected parcels for the installation method, the Install Parcels page reports the installation progress of the parcels you selected earlier. After the parcels are downloaded, progress bars appear representing each cluster host. You can click on an individual progress bar for details about that host.

After the installation is complete, click Continue.

The Inspect hosts for correctness page displays.

Inspect hosts for correctness

The Inspect Hosts page runs the Host Inspector to search for common configuration problems. View the results and address any identified problems. Click the Run Again button to update the results after making any changes.

After addressing identified problems, click Finish.

This completes the Cluster Installation wizard and launches the Cluster Setup wizard.

Continue to Step 7: Set Up a Cluster Using the Wizard.