This is the documentation for CDH 4.7.0.
Documentation for other versions is available at Cloudera Documentation.

CDH Packaging and Tarball Information

Each CDH release series is made up of a collection of CDH project packages that are known to work together. The package version numbers of the CDH projects in each CDH release are listed in the following table.
  Note:

To see the details of all the changes and bug-fixes for a given component in a given release, make sure you read the Changes file as well as the Release Notes, following the links in the tables below.

CDH Version 4.7.0 Packaging and Tarballs

To view the overall release notes for CDH 4.7.0, click here.

Component

Package Version

Tarball

Release Notes

Changes File

DataFu

pig-udf-datafu-0.0.4+11

Tarball

Release notes

Changes

Apache Flume

flume-ng-1.4.0+97

Tarball

Release notes

Changes

Apache Hadoop

hadoop-2.0.0+1603

Tarball

Release notes

Changes

Apache HBase

hbase-0.94.15+113

Tarball

Release notes

Changes

Apache HCatalog

hcatalog-0.5.0+13

Tarball

Release notes

Changes

Apache Hive

hive-0.10.0+258

Tarball

Release notes

Changes

Hue

hue-2.5.0+240

Tarball

Release notes

Changes

Apache Mahout

mahout-0.7+15

Tarball

Release notes

Changes

Apache Oozie

oozie-3.3.2+102

Tarball

Release notes

Changes

Parquet

parquet-1.2.5+71

Tarball

Release notes

Changes

Parquet-format

parquet-format-1.0.0+4

Tarball

Release notes

Changes

Apache Pig

pig-0.11.0+43

Tarball

Release notes

Changes

Apache Sentry (incubating)

sentry-1.1.0+22

Tarball

Release notes

Changes

Apache Sqoop

sqoop-1.4.3+94

Tarball

Release notes

Changes

Apache Sqoop2

sqoop2-1.99.2+99

Tarball

Release notes

Changes

Apache Whirr

whirr-0.8.2+15

Tarball

Release notes

Changes

Apache ZooKeeper

zookeeper-3.4.5+25

Tarball

Release notes

Changes

Examples of Versions

Cloudera packages are designed to be transparent and easy to customize. CDH 4 packages are labeled using the following format:

component-base_version+patch_level

where,

  • component-base_version is the version of the open-source component included in the CDH package
  • patch_level is the number of source commits applied on top of the base version forked from the Apache Hadoop branch. The list of source commits includes all backports and all non-functional changes such as CDH packaging and branding commits. Note that the number of commits does not indicate the number of functional changes or bug fixes in the release. For example, a commit may be used to amend a version number or make other non-functional changes. The list of actual patches that will match the count is found inside the CDH tarball's ../cloudera/patches/ directory. All of the source commits Cloudera has applied is also available in source form in that directory.
For example:

Package

Component

Branch

Base Version

Patch Level

hadoop-2.0.0+1603

hadoop

2.0

2.0.0

1603

hue-2.5.0+240

hue

2.5 

2.5.0

240

parquet-1.2.5+71

parquet

1.2

1.2.5

71

CDH Package Manifests

Both the CDH patched source and packages contain explicit information about Cloudera modifications. For example, in the patched source there is a top-level cloudera directory with:

  • A CHANGES.cloudera.txt file that lists all the changes to the pristine source
  • A patches directory that contains every patch Cloudera has applied to the pristine source. All Cloudera patches are released with an Apache 2.0 license.
  • A files directory for files Cloudera created from scratch, such as man pages and configuration files. All Cloudera files are released with an Apache 2.0 license.
  • A README.cloudera file that explains explicitly how to recreate the patches source from the pristine source.

Build and Release Numbering

If you are installing CDH4 with a package manager, you will also see build and release information as part of the file name. The build and package release fields follow the patch level: for example, hbase-0.92.0+8-1.cdh4.0.0b2.p0.14.el6.noarch.rpm. The suffix -1.cdh4.0.0b2.p0.14.el6.noarch represents:

  • the base of the release field (1)
  • the CDH release (cdh4.0.0b2)
  • the customer patch identifier (p0 — which will be 0 for all regular CDH releases but will increment for customer patches)
  • the build number (14)
  • the distribution (el6 = RHEL/CentOs 6, el5 = RHEL/Centos 5, sles11 = SLES 11)
  • the processor architecture (noarch, x86_64, i386, amd_64). noarch means the packages are not architecture-specific.