Cloudera Essentials for Apache Hadoop | Chapter 3: Hadoop Basic Concepts

Learn about the Hadoop Distributed File System (HDFS): what it is and how it works, MapReduce features and HDFS interaction, and the general topology of a Hadoop cluster.

Date: Friday, Mar 30 2012


There are many components working together in the Apache Hadoop stack. By understanding how each functions, you gain more insight into Hadoop’s functionality in your own IT environment. Dissecting the Apache Hadoop Stack goes beyond the motivation for Apache Hadoop and dissects the Hadoop Distributed File System (HDFS), MapReduce, and the anatomy of a Hadoop cluster.

In this chapter, you will learn:

  • What Hadoop is
  • What features HDFS provides
  • The concepts behind MapReduce
  • How a Hadoop cluster operates

Next Steps