Hadoop Training: Writing MapReduce Programs
pdf Download Date

Wednesday, January 27th, 2010

Description

Now that you're familiar with the tools, and have some ideas about how to write a MapReduce program, this exercise will challenge you to perform a common task when working with large data sets - building an inverted index. More importantly, it teaches you the basic skills you need to write your own, more interesting data processing jobs.

Duration

About 90 Minutes

Materials

Virtual Machine: Cloudera Virtual Machine

Next Steps

Introduction to Hive