Cloudera Backup and Disaster Recovery (BDR) is available with a Cloudera Enterprise license. Enterprise BDR lets you replicate data from one cluster to another, or from one directory path to another on the same or on a different cluster. In case of data loss, the backup replica can be used to restore data to the production cluster.
The time to start thinking about how to restore data is long before you might ever need to do so. These BDR tutorials take you step-by-step through the process of backing up an example production cluster. The example backup replication schedules are for one-time replication that makes a backup copy of Hive datasets or of HDFS files, respectively, on another cluster designated as a backup cluster.
The restore processes detailed in each tutorial also take you step-by-step through the process of restoring data using two different general approaches:
- How To Back Up and Restore Apache Hive Data Using Cloudera Enterprise BDR highlights a one-off data recovery scenario in which you create the replication schedule immediately after a data loss and use it to restore data.
- How To Back Up and Restore HDFS Data Using Cloudera Enterprise BDR shows you how pre-configure replication schedules so they are available when needed.
Use either or both of these tutorials to help plan your own backup and restore strategy.