Proof of Concept Installation Using Embedded Database on Centos
These are the steps to install a demonstration cluster. This configuration is not for production use. It uses the embedded Postgresql database and does not scale to meet the needs of a production cluster. The proof of concept installation lets you try out CDH and services to familiarize yourself with the system.
Step 1: Prepare Hosts
- 3 Primary instances: n1-standard-2 (2 CPUs, 8GB RAM).
- 1 Secondary (master) instance: n1-standard-4 (4 CPUs, 15GB RAM).
Step 2: Prepare Databases
For this proof of concept example, use the embedded PostgreSQL database. The production examples use external databases. That is the primary difference between proof of concept versus production deployments: production deployments must use an external database.
Step 3: Install Cloudera Manager
Download and install Cloudera Manager to your Secondary (master) instance.
- Connect to your primary host as the root user with ssh:
$ ssh firstname.lastname@example.org
- Download the Cloudera Manager installer.
- Change permissions to make the binary file executable.
chmod u+x cloudera-manager-installer.bin
- Run the installer.
- Follow the prompts in the installer wizard to install Cloudera Manager.
- Click Next to detect the Cloudera Manager host and install the package repository for Cloudera and the Java Runtime Environment.
- Review and accept the Cloudera Express license.
- Review and accept the Oracle JRE license.
- When the installation is complete, copy the HTTP address of your Cloudera Manager Server (http://your.server.com:7180).
- Click OK twice to complete the installation.
Step 4: Start the Cloudera Manager Server
- In a web browser, go to http://your.server.com:7180.
- Accept the End-User License Terms and Conditions.
- Choose the Cloudera Express (free) license.
- Click Continue.
- Review the products that will be installed. Click Continue.
- Search for the cluster host names using a pattern. For example, the following pattern:
poc-install-[1-4].vpc.acmecornproducts.comlocates the cluster hosts
poc-install-1.vpc.acmecornproducts.com poc-install-2.vpc.acmecornproducts.com poc-install-3.vpc.acmecornproducts.com poc-install-4.vpc.acmecornproducts.com
- Click Continue.
Step 5: Select and Start Services
- Accept all defaults on Cluster Installation page 1. Click Continue.
- Check the box to Install Oracle Java SE Development Kit (JDK). Click Continue.
- Do not enable Single User Mode. Click Continue.
- Enter the password and confirmation (for example, cloudera) SSH Login Credentials. Click Continue. Wait for the cluster installation to complete. Click Continue.
- Wait for installation of your selected parcels to complete. Click Continue.
- Wait for the validation of your cluster to complete. Click Finish.
Step 6: Set Up Your Cluster
- On Cluster Setup page 1, choose Core Hadoop. Click Continue.
- On page 2, accept the default role assignments. Click Continue.
- On the Database Setup page, choose Use Embedded Database. Copy and paste the passwords for Hive, Hue, and the Oozie Server for future reference.
- Click Test Connection.
- After the connections are validated, click Continue.
- Review your changes and click Continue.
- Wait for the first run command to complete. Click Continue.
- Click Finish, and enjoy your new CDH Cluster.