QuickStart VMs for CDH 5.4.x

A Single-Node Hadoop Cluster and Examples for Easy Learning!

Start testing Hadoop with Cloudera's QuickStart VMs. The QuickStart VMs contain a single-node Apache Hadoop cluster, complete with example data, queries, scripts, and Cloudera Manager to manage your cluster.

The VMs run CentOS 6.4 and are available for VMware, VirtualBox, and KVM.

All require a 64-bit host OS.

 

Please Note: Cloudera QuickStart VMs are for demo purposes only and are not to be used as a starting point for clusters.  

Assistance with the Cloudera Quickstart VM is available through the Cloudera Community.  

For the best download experience, use of a Download Manager is highly recommended!

 

To make it easy for you to get started with CDH, Cloudera Manager, Cloudera Impala, and Cloudera Search, these virtual machines include everything you need.

  Important:
  • These are a 64-bit VMs. They requires a 64-bit host OS and a virtualization product that can support a 64-bit guest OS.
  • To use a VMware VM, you must use a player compatible with WorkStation 8.x or higher: Player 4.x or higher, ESXi 5.x or higher, or Fusion 4.x or higher. Older versions of WorkStation can be used to create a new VM using the same virtual disk (VMDK file), but some features in VMware Tools won't be available.
  • The VM and file size vary according to the CDH version as follows:
    CDH and Cloudera Manager Version RAM Required by VM File Size
    CDH 5 and Cloudera Manager 5 4 GB 3 GB
    CDH 4, Cloudera Impala, Cloudera Search, and Cloudera Manager 4 4 GB 2 GB
VM Version Documentation
CDH 5 and Cloudera Manager 5
  • To learn more about CDH 5 and Cloudera Manager 5, see the Cloudera 5 documentation.
  • For the latest important information about new features, incompatible changes, and known issues, see the Release Guide.
  • For information on the versions of the components in the latest release, and links to each project's changes files and release notes, see the packaging section of Version and Download Information.
  • Cloudera Manager is installed in the VM but is turned off by default. If you would like to use Cloudera Manager, click on the Launch Cloudera Manager icon on the desktop. It is strongly recommended that before you do so, you configure the VM with 8 GB of RAM and 2 virtual CPU cores (by default it will use 4 GB of RAM and 1 virtual CPU core). Cloudera Manager and all of the CDH services may not launch properly with less RAM. After launching Cloudera Manager, all of the services in CDH will be started, although it may take several minutes for Cloudera Manager to start all of the services in order. To conserve resources and improve performance, it is recommended that you stop services you do not plan to use. Changes made to configuration files before launching Cloudera Manager will not be preserved.
    You can start or reconfigure any installed services using the web interface that is automatically displayed when the VM starts.
      Warning: If Cloudera Manager is running, do not use command-line utilities to start, stop, or configure CDH components.
CDH 4, Cloudera Impala, Cloudera Search, and Cloudera Manager 4
  • CDH 4
    • To learn more about CDH 4, see the CDH 4 documentation.
    • For the latest important information about new features, incompatible changes, and known issues in CDH 4, see the CDH 4 Release Notes.
    • For information on the versions of the components in the latest release of CDH 4, and links to each project's changes files and release notes, see the packaging section of CDH Version and Packaging Information.
    • To learn more about Hadoop, see the Cloudera Glossary and the Hadoop Tutorial.
      Note: The hadoop-hdfs-zkfc and hadoop-hdfs-journalnode components will not start in the QuickStart VM because they are HDFS high availability features which are not designed to run on single host QuickStart VM. The failure to start these two components is harmless.
  • Cloudera Manager 4

    As part of the boot process, the VM automatically launches Cloudera Manager and configures HDFS, Hive, Hue, MapReduce, Oozie, ZooKeeper, Flume, HBase, Cloudera Impala, Cloudera Search, and YARN. Only the ZooKeeper, HDFS, MapReduce, Hive, and Hue services are started automatically. Flume, HBase, Oozie, Sqoop, Impala, Solr, and YARN services are not started because they are not used in all cases and not starting them conserves RAM.

    You can start or reconfigure any installed services using the web interface that is automatically displayed when the VM starts.
      Warning: If Cloudera Manager is running, do not use command-line utilities to start, stop, or configure CDH components.
  • Cloudera Impala
  • Cloudera Search

In most cases, the QuickStart VM requires no administration beyond managing the installed products and services. In the event that additional administration is required or that problems occur, this page provides information on accounts and possible explanations and solutions to some common problems.


Accounts

Once you launch the VM, you are automatically logged in as the cloudera user. The account details are:

  • username: cloudera
  • password: cloudera

The cloudera account has sudo privileges in the VM. The root account password is cloudera.

The root MySQL password (and the password for other MySQL user accounts) is also cloudera.

Hue and Cloudera Manager use the same credentials.


QuickStart VMware Image

To launch the VMware image, you will either need VMware Player for Windows and Linux, or VMware Fusion for Mac. Note that VMware Fusion only works on Intel architectures, so older Macs with PowerPC processors cannot run the QuickStart VM.


QuickStart VirtualBox Image

Some users have reported problems running CentOS 6.2 in VirtualBox. If a kernel panic occurs while the VirtualBox VM is booting, you can try working around this problem by opening the Settings > System > Motherboard tab, and selecting ICH9 instead of PIIX3 for the chip set. If you have not already done so, you must also enable I/O APIC on the same tab.


QuickStart KVM Image

The KVM image provides a raw disk image that can be used by many hypervisors. Configure machines that use this image with sufficient RAM. See Cloudera QuickStart VM for the VM size requirements.