Flume: An Introduction

Introduction to Flume: a data collection tool that controls the flow of data from servers into a destination like HDFS, usually log files.

Date: Tuesday, Apr 26 2011

Description

Jonathan Hsieh Introduces Flume to the Apache Hadoop community. Flume is used to collect data, usually log files, and control the flow of this data from servers into a destination like HDFS.