Various projects make up the Apache Hadoop ecosystem, and each improves data storage, management, interaction, and analysis in its own unique way. This chapter takes a close look at these projects, including Hive, Pig, Impala, HBase, Flume, Sqoop, and Oozie, how they function within the stack, and how they help you integrate Hadoop within your environment.
In this chapter, you will learn:
- What other projects exist around core Hadoop
- When to use HBase
- The differences between Hive, Pig, and Impala
- How Flume is typically deployed
- Features of Cloudera Search