Blog Archives
May 2012
- Apache HBase 0.94 is now released
- Meet the Presenter: Todd Lipcon
- Cloudera Manager 4.0 Beta released
- CDH3 update 4 is now available
- Meet the Presenters: Aaron Myers from Cloudera and Suresh Srinivas from Hortonworks
- Announcing Apache Hive 0.9.0
- How Treato Analyzes Health-related Social Media Big Data with Hadoop and HBase
- Apache MRUnit 0.9.0-incubating has been released!
April 2012
- HBaseCon 2012: A Glimpse into the Operations Track
- Introducing CDH4 Beta 2
- HBaseCon 2012: A Glimpse into the Development Track
- Constructing Case-Control Studies With Hadoop
- Sqoop Graduation Meetup
- HBase Hackathon at Cloudera
- HBaseCon 2012: A Glimpse into the Applications Track
- Apache Bigtop 0.3.0 (incubating) has been released
- Apache Sqoop Graduates from Incubator
- Apache Hadoop Versions: Looking Ahead
March 2012
- March 2012 Bay Area HBase User Group meetup summary
- Apache HBase 0.92.1 now available
- Cloudera Manager 3.7.4 released!
- Apache ZooKeeper 3.3.5 has been released
- Authorization and Authentication In Hadoop
- Apache HBase 0.90.6 is now available
- Apache MRUnit 0.8.1-incubating has been released!
- HBase + Hadoop + Xceivers
- Real-Time Your Hadoop! Join us at HBaseCon 2012
- High Availability for the Hadoop Distributed File System (HDFS)
- Cloudera Manager | Activity Monitoring & Operational Reports Demo Video
- Thoughts on Cloudera and Cisco UCS reference architecture for Hadoop
- Indexing Files via Solr and Java MapReduce
February 2012
- Cloudera Manager | Hadoop Service Monitoring Demo Video
- MapReduce 2.0 in Hadoop 0.23
- Cloudera Manager | Log Management, Event Management and Alerting Demo Video
- Apache ZooKeeper 3.4.3 has been released
- Cloudera Manager | Service and Configuration Management Demo Videos
- Introducing CDH4
- Cloudera Connector for Tableau Has Been Released
January 2012
- CDH3, update 3 now available
- January 2012 Bay Area HBase User Group meetup summary + HBaseCon announcement
- Seismic Data Science: Reflection Seismology and Hadoop
- Apache HBase 0.92.0 has been released
- Hadoop World 2011 Videos and Slides Available
- Apache Sqoop: Highlights of Sqoop 2
- Capacity Planning with Cloudera Manager
- Cloudera Manager - Thank You Customers!
- Oracle selects CDH and Cloudera Manager as the Apache Hadoop Platform for the Oracle Big Data Appliance
- Hadoop Selected for the InfoWorld 2012 Technology of the Year Award
- Hadoop in 2011
- An update on Apache Hadoop 1.0
- Caching in HBase: SlabCache
- Cloudera Connector for Teradata 1.0.0
- Hadoop for Archiving Email - Part 2
- What's New in Apache Sqoop 1.4.0-incubating
December 2011
- Apache ZooKeeper 3.4.2 has been released
- Apache HBase 0.90.5 is now available
- How I found Hadoop
- Apache Whirr 0.7.0 has been released
- Apache Avro at RichRelevance
- Notes from the Flume NG Hackathon
- My Internship at Cloudera
- Apache ZooKeeper 3.4.1 has been released
- Cloudera Manager 3.7 released
- Apache Flume - Architecture of Flume NG
- Crunch for Dummies
- FoneDoktor, A WibiData Application
- Apache HBase Pow-wow Summary 11/29/2011
November 2011
- Recommendation with Apache Mahout in CDH3
- Apache ZooKeeper 3.3.4 has been released
- Apache ZooKeeper 3.4.0 has been released
- Inaugural Sqoop Meetup
- Coming Attractions: Apache Hive 0.8.0
- Using Apache Hadoop to Find Signal in the Noise: Analyzing Adverse Drug Events
- Hadoop World 2011 Final Remarks
- Building and Deploying MR2
- Apache Hadoop 0.23.0 has been released
- CDH3u2: Apache Mahout Integration
October 2011
- Attend a Meetup Surrounding the Hadoop World Conference
- CDH3 update 2 is released
- Hadoop World 2011: A Glimpse into Operations
- Nominations Are Open for the 2011 Government Big Data Solutions Award
- Hadoop World 2011: A Glimpse into Development
- Introducing Crunch: Easy MapReduce Pipelines for Hadoop
- Apache Sqoop - Overview
- Hadoop World 2011: A Glimpse into Enterprise Architecture
- The Community Effect
- My Summer Internship at Cloudera
September 2011
- Hadoop World 2011: A Glimpse into Business Solutions
- Hadoop for Archiving Email
- Hadoop World 2011: A Glimpse of the Applications Track
- Hadoop Applied
- Hadoop Tuesdays: Get a Handle on Unstructured Data with a 7-part Webinar Series Led By Cloudera and Informatica
- Snappy and Hadoop
- Cloudera Training for Apache Hadoop and Certification at Hadoop World
- Which free book will you choose at Hadoop World? Hadoop or HBase?
July 2011
- CDH3 Update 1 Released
- Hoop - Hadoop HDFS over HTTP
- RecordBreaker: Automatic structure for your text-formatted data
- Evolution of Hadoop Ecosystem: AOL Advertising Experience
- Cloudera Service and Configuration Manager Express Edition Screencast (Director's Cut)
- Data Interoperability with Apache Avro
- SCM Express: Now Anyone Can Experience the Power of Apache Hadoop
- The Only Full Lifecycle Management for Apache Hadoop: Introducing Cloudera Enterprise 3.5 and SCM Express
June 2011
- Shopzillas Apache Hadoop Hackathon: Learning To Contribute
- If 80% of data is unstructured, is it the exception or a new rule?
- Reflections from Enzee Universe 2011
- Migrating from Elastic MapReduce to a Clouderas Distribution including Apache Hadoop Cluster
- Biodiversity Indexing: Migration from MySQL to Hadoop
- CDH 3 Demo VM installation on Mac OS X using VirtualBox
May 2011
April 2011
- An Attendee Perspective On Chicago Data Summit
- Solve this Brain Buster for a chance to win a Doug Cutting Bobblehead at the Chicago Data Summit
- HBase Do's and Don'ts
- CDH3 goes GA
- Simple Moving Average, Secondary Sort, and MapReduce (Part 3)
- Adopting Apache Hadoop in the Federal Government
- MapIncrease
March 2011
- London Apache Hadoop User Group Meeting Summarized
- Learn about Apache Hadoop at the Chicago Data Summit
- We messed up.
- Rapleaf Uses Hadoop to Efficiently Scale with Terabytes of Data
- Simple Moving Average, Secondary Sort, and MapReduce (Part 2)
- Simple Moving Average, Secondary Sort, and MapReduce (Part 1)
- Avoiding Full GCs in HBase with MemStore-Local Allocation Buffers: Part 3
- Flume Community Office Hours @ Cloudera HQ, 2/28/2011
February 2011
- Avoiding Full GCs in HBase with MemStore-Local Allocation Buffers: Part 2
- Supported Operating Systems in CDH3
- Gratuitous Hadoop: Stress Testing on the Cheap with Hadoop Streaming and EC2
- Avoiding Full GCs in HBase with MemStore-Local Allocation Buffers: Part 1
- CDH3 Beta 4 Now Available
- Log Event Processing with HBase
- An emerging data management architectural pattern behind interactive web applications
- Strategies for Exploiting Large-scale Data in the Federal Government
- Cloudera in The Cube with Silicon Angle TV at Strata Conference 2011
- Wordnik Bypasses Processing Bottleneck with Hadoop
- Hadoop Availability
- Distributed Flume Setup With an S3 Sink
- Make your Hadoop voice heard!
- Upcoming Apache Hadoop Training Sessions
- Some News Related to the Apache Hadoop Project
January 2011
- CDH2 Update 3 Now Available
- Lessons Learned from Cloudera's Hadoop Developer Training Course
- Introducing Alfredo, Kerberos HTTP SPNEGO for Java
- Top 10 Blog Posts of 2010
- Hadoop I/O: Sequence, Map, Set, Array, BloomMap Files
- How to Include Third-Party Libraries in Your Map-Reduce Job
- Setting up CDH3 Hadoop on my new Macbook Pro
- Configuring Security Features in CDH3
- 2010 Cloudera Apache Hadoop Webinars
- Map-Reduce With Ruby Using Apache Hadoop
December 2010
- New Features in Apache Pig 0.8
- A profile of Apache Hadoop MapReduce computing efficiency (continued)
- A profile of Apache Hadoop MapReduce computing efficiency
- Cloudera and Pentaho team up to simplify data management and business intelligence
- Lessons learned putting Hadoop into production
- Hadoop World 2010 Tweet Analysis
November 2010
- Hadoop Log Location and Retention
- Hadoop training coming to new cities in 2011
- Do the Schimmy: Efficient Large-Scale Graph Analysis with Hadoop, Part 2
- Hadoop and HBase at RIPE NCC
- Do the Schimmy: Efficient Large-Scale Graph Analysis with Hadoop
- Integrating Hadoop in your Existing DW and BI Environment
- Better Workflow Management in CDH with Oozie 2
- Tackling Large Scale Data in Government
- Cloudera Fun & Frightful Halloween Festivities
September 2010
- Hadoop World: More is better!
- Top 10 Reasons to Attend Hadoop World
- Twitter Analytics Lead, Kevin Weil, and a Presenter at Hadoop World Interviewed
- More on Cloudera Enterprise
- Whats Going On Surrounding Hadoop World
- What is in our Kitchen?
- Using Flume to Collect Apache 2 Web Server Logs
- HUE SDK Training - NYC
- CDH2 Update 2 Now Available
- Hadoop World Presentation Track Release
- A Summer Internship with Cloudera
- New York Training Session for Managers Interested In Hadoop
- Flume community update: September 2010
- Purdue Universitys Saptarshi Guha Interviewed Regarding Hadoop, R and Hadoop World
- A Look Back at August Posts
- Tracing with Avro
- Infochimp's President, Philip Kromer, Interviewed Regarding Hadoop and Hadoop World
- Register for Hadoop Training in New York and Get into Hadoop World for Free!
August 2010
- Hadoop World 2010: Speaker Highlights
- Whats New in Apache Hadoop 0.21
- Using Hadoop for Fraud Detection and Prevention
- Hadoop Administrator Training Comes to London
- Improving Hotel Search: Hadoop @ Orbitz Worldwide
- Hadoop World: NYC - Training
- Hadoop/HBase Capacity Planning
- Avoiding Common Hadoop Administration Issues
- CDH3b2 Release Recap
- Cloudera's Henry Robinson to speak at Hadoop Day in Seattle
- Hadoop World: early-bird rate ends on August 11
- Flume community update - the first 30 days!
- Migrating to CDH
July 2010
- How to Get a Job at Cloudera
- Notes From the Hackathon at Cloudera
- Upcoming webinar: 10 Common Hadoop-able Problems
- Announcing Two New Training Classes from Cloudera: Introduction to HBase and Analyzing Data with Hive and Pig
- What's New in CDH3b2: Hive
- Developing Applications for HUE
- What's New in CDH3b2: HUE
- Rackspaces OpenStack shows the way for public cloud vendors
- Whats New in CDH3b2: Sqoop
- Hacking with Cloudera on CDH
- What's New in CDH3b2: Oozie
- What's New in CDH3b2: Pig
- What's New in CDH3b2: Flume
- What's New in CDH3b2: ZooKeeper
- What's New in CDH3b2: HBase
- What's New in CDH3b2: Core Hadoop
- More on Cloudera's Distribution including Apache Hadoop 3
June 2010
- CDH3 and Cloudera Enterprise
- Upcoming webinar: Tackling Big Data Challenges with Vertica and Hadoop
- Cloudera Hosting Hadoop World 2010: Call for Speakers Now Open
- Cloudera to participate at OSCON 2010
- Integrating Hive and HBase
- One word more...
- A transition
- Reporting from the UK Hadoop Users Group
- Considerations for Hadoop and BI (part 2 of 2)
- The Second Apache Hadoop HDFS and MapReduce Contributors Meeting
May 2010
April 2010
- Exciting new Hadoop Training Offerings from Cloudera
- CAP Confusion: Problems with 'partition tolerance'
- Get Hadoop Training from Cloudera at the Hadoop Summit
- Cloudera Hadoop Training Spreads Worldwide
- Cloudera Has Moved!
- Scaling Social Science with Hadoop
- Pushing the Limits of Distributed Processing
March 2010
- Cloudera's Support Team Shares Some Basic Hardware Recommendations
- CDH3 Beta 1 Now Available
- CDH2 is released
- How Raytheon BBN Technologies Researchers are Using Hadoop to Build a Scalable, Distributed Triple Store
- HBase User Group #9: HBase and HDFS
- Natural Language Processing with Hadoop and Python
- Why Europe's Largest Ad Targeting Platform Uses Hadoop
- Trip Report: Utah Java User's Group
- Avro 1.3.0
January 2010
December 2009
- Hadoop World: Making Hadoop Easy on Amazon Web Services
- Hadoop World: Hadoop Applications at Yahoo!
- 7 Tips for Improving MapReduce Performance
- Observers: Making ZooKeeper Scale Even Further
- Hadoop World: Sqoop - Database Import for Hadoop
- Hadoop World: Security and API Compatibility
- Hadoop World: Hadoop for Bioinformatics
November 2009
- Hadoop World: Practical HBase from Jonathan Gray and Ryan Rawson
- Hadoop World: Hadoop + Vertica from Omer Trajman
- Hadoop World: Hadoop + Clojure from Stuart Sierra and Tim Dysinger
- Hadoop World: Protein Alignment from Paul Brown
- Hadoop at Twitter (part 1): Splittable LZO Compression
- Hadoop World: Rethinking the Data Warehouse with Hadoop and Hive from Ashish Thusoo
- Hadoop World: Monitoring Best Practices from Ed Capriolo
- Avro: a New Format for Data Interchange
October 2009
September 2009
July 2009
May 2009
- Common Questions and Requests From Our Users
- Building a distributed concurrent queue with Apache ZooKeeper
- Announcing Cloudera Certification for Hadoop
- Announcing Hadoop World: NYC 2009: RFP Open
- Protecting per-DataNode Metadata
- 10 MapReduce Tips
- 5 Common Questions About Hadoop
- Using Cloudera's Hadoop AMIs to process EBS datasets on EC2
- Whats New in Hadoop Core 0.20
- High Energy Hadoop
April 2009
- Debian packages for Hadoop
- Pig Training Now Available Online
- Using Hadoop to Annotate Billions of Web Documents with Semantics
- The Second Hadoop UK User Group Meeting
- Configuring Eclipse for Hadoop Development (a screencast)
- Hive and JobTracker Needed Logos...
- Cloudera's Distribution for Hadoop: Making Hadoop Easier for a Sysadmin
- Upcoming Functionality in "Fair Scheduler 2.0"
March 2009
January 2009
November 2008
- Overview
- Downloads
- Learn Hadoop
- Get Support
-
Blog
- Avro (11)
- Careers (10)
- CDH (29)
- Cloudera Manager (10)
- Cloudera's Service And Configuration Manager (6)
- Community (86)
- Connector (6)
- Data Collection (13)
- Distribution (34)
- Flume (6)
- General (237)
- Guest (35)
- Hadoop (146)
- HBase (40)
- HDFS (26)
- Hive (22)
- MapReduce (36)
- Oozie (4)
- Pig (15)
- Sqoop (9)
- Testing (5)
- Training (18)
- Use Case (11)
- Whirr (1)
- ZooKeeper (10)
- Archives by Month
