Migrating Data between a CDH 4 and CDH 5 Cluster

You can migrate the data from a CDH 4 (or any Apache Hadoop) cluster to a CDH 5 cluster by using a tool that copies out data in parallel, such as the DistCp tool offered in CDH 5. This can be useful if you are not planning to upgrade your CDH 4 cluster itself at this point. The following sections provide information and instructions: