There are many components working together in the Apache Hadoop stack. By understanding how each functions, you gain more insight into Hadoop’s functionality in your own IT environment. Dissecting the Apache Hadoop Stack goes beyond the motivation for Apache Hadoop and dissects the Hadoop Distributed File System (HDFS), MapReduce, and the anatomy of a Hadoop cluster.
In this chapter, you will learn:
- What Hadoop is
- What features HDFS provides
- The concepts behind MapReduce
- How a Hadoop cluster operates