Step 6: Install CDH and Other Software
After setting up the Cloudera Manager database, start Cloudera Manager Server, and log in to the Cloudera Manager Admin Console:
- Start Cloudera Manager Server:
- RHEL 7 compatible, Ubuntu, SLES:
sudo systemctl start cloudera-scm-server
- RHEL 6 compatible:
sudo service cloudera-scm-server start
- RHEL 7 compatible, Ubuntu, SLES:
- Wait several minutes for the Cloudera Manager Server to start. To observe the startup process, run the following on the Cloudera Manager Server host:
sudo tail -f /var/log/cloudera-scm-server/cloudera-scm-server.logWhen you see this log entry, the Cloudera Manager Admin Console is ready:
INFO WebServerImpl:com.cloudera.server.cmf.WebServerImpl: Started Jetty server.
If the Cloudera Manager Server does not start, see Troubleshooting Installation Problems.
- In a web browser, go to http://<server_host>:7180, where <server_host> is the FQDN or IP address of the host where the Cloudera Manager Server is running.
- Log into Cloudera Manager Admin Console. The default credentials are:
- After you log in, the End User License Terms and Conditions page displays. Read the terms and conditions and then check the box labeled Yes, I accept the End User License Terms and Conditions to accept them. Click Continue, and the installation wizard launches.
The Welcome page provides a brief overview of Cloudera Manager, and links to the release notes for the version you are installing. Click Continue to proceed with the installation.
The Accept License page provides the End User License Terms and Conditions. Read the license agreement and click the checkbox labeled Yes, I accept the End User License Terms and Conditions if you accept the terms and conditions of the license agreement.
On the Select Edition page, you can select the edition of Cloudera Manager to install and, optionally, install a license:
- Choose which edition to install:
- Cloudera Express, which does not require a license, but provides a limited set of features.
- Cloudera Enterprise Cloudera Enterprise Trial, which does not require a license, but expires after 60 days and cannot be renewed.
- Cloudera Enterprise with one of the following license types:
- Essentials Edition
- Data Science and Engineering Edition
- Operational Database Edition
- Data Warehouse Edition
- Enterprise Data Hub Edition
- If you select Cloudera Enterprise, install a license:
- Click the Select License File field.
- Browse to the location of your license file, click the file, and click Open.
- Click Upload.
- Information is displayed indicating what the CDH installation includes. At this point, you can click the Support drop-down menu to access online Help or the Support Portal.
- Click Continue to proceed with the installation.
Welcome (Add Cluster - Installation)
The Welcome page of the Add Cluster - Installation wizard provides a brief overview of the installation and configuration procedure, as well as some links to relevant documentation. Click Continue to proceed with the installation.
The Setup Auto-TLS page provides instructions for initializing the certificate manager for auto-TLS if you have not done so already. If you already initialized the certificate manager in Step 3: Install Cloudera Manager Server, the wizard displays a message indicating that auto-TLS has been initialized. Click Continue to proceed with the installation.
If you have not already initialized the certificate manager, and you want to enable auto-TLS, follow the instructions provided on the page before continuing. For more information, see Configuring TLS Encryption for Cloudera Manager and CDH Using Auto-TLS.
Choose which hosts will run CDH and other managed services
- To enable Cloudera Manager to automatically discover hosts on which to install CDH and managed services, enter the cluster hostnames or IP addresses. You can also specify hostname and
IP address ranges. For example:
Range Definition Matching Hosts 10.1.1.[1-4] 10.1.1.1, 10.1.1.2, 10.1.1.3, 10.1.1.4 host[1-3].example.com host1.example.com, host2.example.com, host3.example.com host[07-10].company.com host07.example.com, host08.example.com, host09.example.com, host10.example.com
You can specify multiple addresses and address ranges by separating them with commas, semicolons, tabs, or blank spaces, or by placing them on separate lines. Use this technique to make more specific searches instead of searching overly wide ranges. Only scans that reach hosts running SSH will be selected for inclusion in your cluster by default. You can enter an address range that spans over unused addresses and then clear the nonexistent hosts later in the procedure, but wider ranges require more time to scan.
- Click Search. If there are a large number of hosts on your cluster, wait a few moments to allow them to be discovered and shown in the wizard. If the search is taking too long, you can stop the scan by clicking Abort Scan. You can modify the search pattern and repeat the search as many times as you need until you see all of the expected hosts.
- Verify that the number of hosts shown matches the number of hosts where you want to install services. Clear host entries that do not exist or where you do not want to install services.
- Click Continue.
The Select Repository screen displays.
The Select Repository page allows you to specify repositories for Cloudera Manager Agent and CDH and other software.
In the Cloudera Manager Agent section:
- Select either Public Cloudera Repository or Custom Repository for the Cloudera Manager Agent software.
- If you select Custom Repository, do not include the operating system-specific paths in the URL. For instructions on setting up a custom repository, see Using an Internal Package Repository.
In the CDH and other software section:
- Select the repository type to use for the installation. In the Install Method section select one of the following:
- Use Parcels (Recommended)
A parcel is a binary distribution format containing the program files, along with additional metadata used by Cloudera Manager. Parcels are required for rolling upgrades. For more information, see Parcels.
- Use Packages
A package is a standard binary distribution format that contains compiled code and meta-information such as a package description, version, and dependencies. Packages are installed using your operating system package manager.
- Use Parcels (Recommended)
- Select the version of CDH to install.
- If you selected Use Parcels and you do not see the version you want to install, click the More Options button to add the repository URL for your version. Repository URLs for CDH 6 version are documented in CDH 6 Download Information. After adding the repository, click Save Changes and wait a few seconds for the version to appear. If your Cloudera Manager host uses an HTTP proxy, click the Proxy Settings button to configure your proxy.
- If you selected Use Packages, and the version you want to install is not listed, you can select Custom Repository to specify a repository that contains the desired version. Repository URLs for CDH 6 version are documented in CDH 6 Download Information.
- If you selected Use Parcels, specify any Additional Parcels you want to install. If you are installing CDH 6, do not select the KAFKA, KUDU, or SPARK parcels, because they are included in CDH 6.
- Click Continue.
The Accept JDK License page displays.
Accept JDK License
To allow Cloudera Manager to automatically install the Oracle JDK on cluster hosts, read the JDK license and check the box labeled Install Oracle Java SE Development Kit (JDK) if you accept the terms. If you installed your own Oracle JDK version in Step 2: Install Java Development Kit, leave the box unchecked.
If you allow Cloudera Manager to install the JDK, a second checkbox appears, labeled Install Java Unlimited Strength Encryption Policy Files. These policy files are required to enable AES-256 encryption in JDK versions lower than 1.8u161. JDK 1.8u161 and higher enable unlimited strength encryption by default, and do not require policy files.
After reading the license terms and checking the applicable boxes, click Continue.
Enter Login Credentials
- Select root for the root account, or select Another user and enter the username for an account that has password-less sudo privileges.
- Select an authentication method:
- If you choose password authentication, enter and confirm the password.
- If you choose public-key authentication, provide a passphrase and path to the required key files.
You can modify the default SSH port if necessary.
- Specify the maximum number of host installations to run at once. The default and recommended value is 10. You can adjust this based on your network capacity.
- Click Continue.
The Install Agents page displays.
The Install Agents page displays the progress of the installation. You can click on the Details link for any host to view the installation log. If the installation is stalled, you can click the Abort Installation button to cancel the installation and then view the installation logs to troubleshoot the problem.
If the installation fails on any hosts, you can click the Retry Failed Hosts to retry all failed hosts, or you can click the Retry link on a specific host.
If you selected the option to manually install agents, see Install Cloudera Manager Packages for the procedure and then continue with the next steps on this page.
After installing the Cloudera Manager Agent on all hosts, click Continue.
If you are using parcels, the Install Parcels page displays. If you chose to install using packages, the Inspect Hosts page displays.
If you selected parcels for the installation method, the Install Parcels page reports the installation progress of the parcels you selected earlier. After the parcels are downloaded, progress bars appear representing each cluster host. You can click on an individual progress bar for details about that host.
After the installation is complete, click Continue.
The Inspect Hosts page displays.
The Inspect Hosts page runs the Host Inspector to search for common configuration problems. View the results and address any identified problems. Click the Run Again button to update the results after making any changes.
After addressing identified problems, click Finish.
This completes the Cluster Installation wizard and launches the Cluster Setup wizard.
Continue to Step 7: Set Up a Cluster Using the Wizard.