Cloudera named a leader in 2022 Gartner® Magic Quadrant™ for Cloud Database Management Systems Get the report

CDH is Cloudera's software distribution containing Apache Hadoop and related projects. All components are 100% open source (Apache License); see Release Notes. Unless otherwise specified, use these installation instructions for all CDH components.

Apache Avro

Release: 1.8.2
Data serialization: rich data structures, a fast/compact binary format, and RPC.

Apache Kudu

Release:1.6.0
Completes Hadoop's storage layer to enable fast analytics on fast data.

Kite SDK

Release: 1.0.0
APIs, examples, and docs for building apps on top of Hadoop. Only in CDH!

Apache Flume

Release: 1.9.0
Collects/aggregates event data and streams it into HDFS or HBase in real time.

Apache Hadoop

Release: 2.6.0
Infinitely scalable storage, resource management, and processing.

Apache HBase

Release: 1.2.0
Scalable record and table storage for Hadoop with random read/write access.

Apache Hive

Release: 1.1.0
SQL framework for doing batch transformation (ETL) of Hadoop data.

Apache Impala

Release: 2.9.0
For high-concurrency, low-latency SQL queries across HDFS, S3, or HBase.

Apache Kafka®

Release: 0.9.0
Kafka is distributed, resilient, publish-subscribe messaging service.

Apache Oozie

Release: 4.1.0
A workflow scheduler for managing all your Hadoop jobs efficiently.

Apache Parquet

Release: 1.5
Provides compressed, efficient columnar data representation in Hadoop.

Apache Sqoop

Release: 1.4.7 / 1.99.5
Moves data across relational databases and HDFS in a highly scalable way.

Apache Pig

Release: 0.12.0
Offers a framework for batch analysis of large data sets using a high-level language.

Apache Sentry

Release: 1.5.1
Provides granular support, role-based access control for Hadoop users.

Apache Spark™

Release: 2.2.1
Does in-memory processing to make jobs faster and easier to write.

HUE

Release: 3.12.0
Web-based GUI that makes it easy for users to work with Hadoop data.

Apache ZooKeeper

Release: 3.4.5
Highly reliable distributed coordination service used in HBase, among other places.

Cloudera Search

Release: 1.0.0
Free-text, Google-style search of Hadoop data for business users. Only in CDH!

Your form submission has failed.

This may have been caused by one of the following:

  • Your request timed out
  • A plugin/browser extension blocked the submission. If you have an ad blocking plugin please disable it and close this message to reload the page.