Open Source

Open source is a central part of Cloudera’s business. We recognize that Hadoop’s success comes in large part from being an open platform. To further this goal, Cloudera’s Distribution for Hadoop is 100% open source, Apache licensed.

Cloudera does not just package and distribute open source software, we actively contribute to it. More than 50% of Cloudera engineering investment goes back to Apache-licensed open source projects. Projects Cloudera contributes to include:

Name Project URL
Apache Avro http://avro.apache.org/
Apache Hadoop Common http://hadoop.apache.org/common
Apache HBase http://hbase.apache.org
Apache HDFS http://hadoop.apache.org/hdfs
Apache Hive http://hadoop.apache.org/hive
Apache MapReduce http://hadoop.apache.org/mapreduce
Apache Pig http://hadoop.apache.org/pig
Apache Whirr http://incubator.apache.org/projects/whirr.html
Apache ZooKeeper http://hadoop.apache.org/zookeeper
Crepo http://github.com/cloudera/crepo
Flume http://github.com/cloudera/flume
Hadoop LZO http://github.com/toddlipcon/hadoop-lzo
Hue http://github.com/cloudera/hue
JCarder http://github.com/toddlipcon/jcarder
MooTools http://github.com/mootools/mootools-core
Oozie http://github.com/cwsteinbach/oozie1
Sqoop http://github.com/cloudera/sqoop