CDH 5.1.2

Cloudera’s 100% Open Source Hadoop Platform

CDH is Cloudera's open source software distribution and consists of Apache Hadoop and additional key open source projects to ensure you get the most out of Hadoop and your data.

It is the only Hadoop solution to offer unified querying options (including batch processing, interactive SQL, text search, and machine learning) and necessary enterprise security features (such as role-based access controls).

Please note: CDH requires manual installation from the command line.
For a faster, automated installation download Cloudera Manager.

CDH Packaging and Tarball Information

Each CDH release series is made up of a collection of CDH project packages that are known to work together. The package version numbers of the CDH projects in each CDH release are listed in the following table.
  Important: When installing CDH from Cloudera tarballs, note that certain features of Hadoop have not been included in the tarballs and will only be available if you have the native libraries present.

To see the details of all the changes and bug-fixes for a given component in a given release, make sure you read the Changes file as well as the Release Notes, following the links in the tables below.

Continue reading:

CDH 5.1.2 Packaging and Tarballs

  Note:

There is no CDH 5.1.1 release. This skip in the CDH 5.x sequence allows the CDH and CM components of Cloudera Enterprise 5.1.2 to have consistent numbering.

This section only contains packaging information for the current release. To see packaging and tarball information for older releases, refer CDH Packaging Information for Previous Releases. To view the overall release notes for CDH 5.x.x, CDH 5 Release Notes.

Component

Package Version

Tarball

Release Notes

Changes File

Apache Avro

avro-1.7.5+cdh5.1.2+32

Tarball

Release notes

Changes

Apache Crunch

crunch-0.10.0+cdh5.1.2+16

Tarball

Release notes

Changes

DataFu

pig-udf-datafu-1.1.0+cdh5.1.2+11

Tarball

Release notes

Changes

Apache Flume

flume-ng-1.5.0+cdh5.1.2+13

Tarball

Release notes

Changes

Apache Hadoop

hadoop-2.3.0+cdh5.1.2+816

Tarball

Release notes

Changes

Apache HBase

hbase-0.98.1+cdh5.1.2+70

Tarball

Release notes

Changes

HBase-Solr

hbase-solr-1.5+cdh5.1.2+14

Tarball

Release notes

Changes

Apache Hive

hive-0.12.0+cdh5.1.2+375

Tarball

Release notes

Changes

Hue

hue-3.6.0+cdh5.1.2+97

Tarball

Release notes

Changes

Cloudera Impala

impala-1.4+cdh5.1.2+0

(none)

Release notes

Changes

Kite SDK

kite-0.10.0+cdh5.1.2+122

Tarball

Release notes

Changes

Llama

llama-1.0.0+cdh5.1.2+0

Tarball

Release notes

Changes

Apache Mahout

mahout-0.9+cdh5.1.2+13

Tarball

Release notes

Changes

Apache Oozie

oozie-4.0.0+cdh5.1.2+252

Tarball

Release notes

Changes

Parquet

parquet-1.2.5+cdh5.1.2+136

Tarball

Release notes

Changes

Parquet-format

parquet-format-1.0.0+cdh5.1.2+8

Tarball

Release notes

Changes

Apache Pig

pig-0.12.0+cdh5.1.2+35

Tarball

Release notes

Changes

Cloudera Search

search-1.0.0+cdh5.1.2+0

Tarball

Release notes

Changes

Apache Sentry (incubating)

sentry-1.3.0+cdh5.1.2+160

Tarball

Release notes

Changes

Apache Solr

solr-4.4.0+cdh5.1.2+233

Tarball

Release notes

Changes

Apache Spark

spark-1.0.0+cdh5.1.2+43

Tarball

Release notes

Changes

Apache Sqoop

sqoop-1.4.4+cdh5.1.2+57

Tarball

Release notes

Changes

Apache Sqoop2

sqoop2-1.99.3+cdh5.1.2+28

Tarball

Release notes

Changes

Apache Whirr

whirr-0.9.0+cdh5.1.2+11

Tarball

Release notes

Changes

Apache ZooKeeper

zookeeper-3.4.5+cdh5.1.2+31

Tarball

Release notes

Changes

CDH Package Manifests

Both the CDH patched source and packages contain explicit information about Cloudera modifications. For example, in the patched source there is a top-level cloudera directory with:

  • A CHANGES.cloudera.txt file that lists all the changes to the pristine source
  • A patches directory that contains every patch Cloudera has applied to the pristine source. All Cloudera patches are released with an Apache 2.0 license.
  • A files directory for files Cloudera created from scratch, such as man pages and configuration files. All Cloudera files are released with an Apache 2.0 license.
  • A README.cloudera file that explains explicitly how to recreate the patches source from the pristine source.

Build and Release Numbering

If you are installing CDH 5 with a package manager, you will also see build and release information as part of the file name. The build and package release fields follow the patch level: for example, hbase-0.98.1+cdh5.1.0+64-1.cdh5.1.0.p0.34.el6.x86_64.rpm. The suffix -1.cdh5.1.0.p0.34.el6.x86_64 represents:

  • the base of the release field (1)
  • the CDH release (cdh5.1.0)
  • the customer patch identifier (p0 — which will be 0 for all regular CDH releases but will increment for customer patches)
  • the build number (34)
  • the distribution (el6 = RHEL/CentOs 6, el5 = RHEL/Centos 5, sles11 = SLES 11)
  • the processor architecture (x86_64, noarch, i386, amd_64). noarch means the packages are not architecture-specific.

Fixed Issues in Cloudera Manager 5.1.2

Could not find a healthy host with CDH 5 on it to create HiveServer2 error during upgrade

When upgrading from CDH 4 to CDH 5, if no parcel is active then the error message "Could not find a healthy host with CDH5 on it to create HiveServer2" displays. This can happen when transitioning from packages to parcels, or if you explicitly deactivate the CDH 4 parcel (which is not necessary) before upgrade.

Workaround: Wait 30 seconds and retry the upgrade.

AWS installation wizard requires Java 7u45 to be installed on Cloudera Manager Server host

Cloudera Manager 5.1 installs Java 7u55 by default. However, the AWS installation wizard does not work with Java 7u55 due to a bug in the jClouds version packaged with Cloudera Manager.

Workaround:
  1. Stop the Cloudera Manager Server.
    $ sudo service cloudera-scm-server stop 
  2. Uninstall Java 7u55 from the Cloudera Manager Server host.
  3. Install Java 7u45 (which you can download from http://www.oracle.com/technetwork/java/javase/downloads/java-archive-downloads-javase7-521261.html#jdk-7u45-oth-JPR) on the Cloudera Manager Server host.
  4. Start the Cloudera Manager Server.
    $ sudo service cloudera-scm-server start
  5. Run the AWS installation wizard.
  Note: Due to a bug in Java 7u45 (http://bugs.java.com/bugdatabase/view_bug.do?bug_id=8014618), SSL connections between the Cloudera Manager Server and Cloudera Manager Agents and between the Cloudera Management Service and CDH processes break intermittently. If you do not have SSL enabled on your cluster, there is no impact.

The YARN property ApplicationMaster Max Retries has no effect in CDH 5

The issue arises because yarn.resourcemanager.am.max-retries was replaced with yarn.resourcemanager.am.max-attempts.

Workaround:
  1. Add the following to ResourceManager Advanced Configuration Snippet for yarn-site.xml, replacing MAX_ATTEMPTS with the desired maximum number of attempts:
    <property>
    <name>yarn.resourcemanager.am.max-attempts</name><value>MAX_ATTEMPTS</value>
    </property>
  2. Restart the ResourceManager(s) to pick up the change.

(BDR) Replications can be affected by other replications or commands running at the same time

Replications can be affected by other replications or commands running at the same time, causing replications to fail unexpectedly or even be silently skipped sometimes. When this occurs, a StaleObjectException is logged to the Cloudera Manager logs. This is known to occur even with as few as four replications starting at the same time.

CDH 5.x System Requirements:

Supported Operating Systems

Supported JDK Versions

Supported Databases