Cloudera for Developers and Admins
CDH, Cloudera's 100% open-source distribution of Apache Hadoop and related projects, offers new approaches to data processing and analytics via a variety of APIs and frameworks, in batch or real time. And via Cloudera Manager or its API, admins can deploy, manage, and monitor CDH clusters efficiently in a production environment.
What's New
Meet the Project Founder: Roman Shaposhnik
May 17, 2013
This installment of “Meet the Project Founder” features Apache Bigtop founder and PMC Chair/VP Roman Shaposhnik.
How-to: Configure Eclipse for Hadoop Contributions
May 15, 2013
Contributing to Apache Hadoop or writing custom pluggable modules requires modifying Hadoop’s source code. This how-to covers configuring Eclipse to do that.
Fresh and Hot: HBaseCon 2013 Schedule Finalized!
May 14, 2013
If you lacked motivation to register up until this point, we think that this session line-up will convince you otherwise. HBaseCon is simply an event that you can’t afford to miss.
Migrating Our Hadoop Cluster from CDH3 to CDH4 (via Oliver Meyn)
May 14, 2013
We've written a number of times on the initial setup, eventual upgrade, and continued tuning of our Hadoop cluster. Our latest project has been upgrading from CDH3u3 to CDH4.2.1.
How-to: Automate Your Hadoop Cluster from Java
May 13, 2013
One of the complexities of Apache Hadoop is the need to deploy clusters of servers, potentially on a regular basis. Learn an approach to cluster automation, based on the Cloudera Manager API, that works for Cloudera.
Metrics2: The New Hotness for Apache HBase Metrics
May 8, 2013
An overview of the new Metrics system for HBase, Metrics2.
Cloudera Development Kit (CDK): Hadoop Application Development Made Easier
May 7, 2013
We’re really excited to announce the Cloudera Developer Kit (CDK), a new open source project designed to help developers get up and running to build applications on CDH.