This is the documentation for CDH 5.0.x. Documentation for other versions is available at Cloudera Documentation.

Requirements

  1. The CDH 5 cluster must have a MapReduce service running on it. This may be MRv1 or YARN (MRv2).
  2. All the MapReduce nodes in the CDH 5 cluster should have full network access to all the nodes of the source cluster. This allows you to perform the copy in a distributed manner.
  Note:

The term source refers to the CDH 4 (or other Hadoop) cluster you want to migrate or copy data from; and destination refers to the CDH 5 cluster.

Page generated September 3, 2015.