Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/hbasecon2014/state-of-hbase--meet-the-release-managers/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    State of HBase: Meet the Release Managers
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    HBase release managers Lars Hofhansl, Andrew Purtell, Enis Soztutar, Michael Stack, and Liyin Tang jointly present highlights from their releases, and take your questions throughout.
  2. /content/cloudera/en/resources/library/hbasecon2014/a-graph-service-for-global-web-entities-traversal-and-reputation-ppt/jcr:content/mainContent/resourcecomponent.img.png/1418342561185.png
    A Graph Service for Global Web Entities Traversal and Reputation Evaluation Based on HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    This presentation covers what problems we try to solve, what and how the design decisions we made, how we design such a graph model, and the graph computation tasks involved.
  3. /content/cloudera/en/resources/library/hbasecon2014/tasmo--building-hbase-applications-from-event-streams/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    Tasmo: Building HBase Applications From Event Streams
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    Tasmo is a system that enables application development on top of event streams and HBase.
  4. /content/cloudera/en/resources/library/hbasecon2014/tales-from-the-cloudera-field---operations-session-4/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    Tales from the Cloudera Field - Operations Session 4
    • Monday, Jun 16 2014
    • Category: Presentation, HBaseCon, Video
    From supporting the 0.90.x, 0.92, 0.94, and 0.96 HBase installations on clusters ranging from tens to hundreds of nodes, Cloudera has seen it all. Having automated the upgrade paths from the different Apache releases, we have developed a smooth path that can help the community with upcoming upgrades. In addition to automation best practices, in this talk you'll also learn proactive configuration tweaks and operational best practices to keep your HBase cluster always up and running. We'll also walk through how to contain an application bug let loose in production, to minimize the impact on HBase posed by faulty hardware, and the direct correlation between inefficient schema design and HBase performance.
  5. /content/cloudera/en/resources/library/hbasecon2014/hbase--just-the-basics/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    HBase: Just the Basics
    • Monday, May 05 2014
    • Category: Video, Presentation, HBaseCon
    A brief Cliff's Notes-level talk covering architecture, API, and schema design
  6. /content/cloudera/en/resources/library/hbasecon2014/blackbird--storing-billions-of-rows-a-couple-of-milliseconds-awa/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    Blackbird: Storing Billions of Rows a Couple of Milliseconds Away
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    Would you use HBase to make billions of rows available for real-time lookup under 10 ms with 99% guarantee? We, at Rocket Fuel, do just that.
  7. /content/cloudera/en/resources/library/hbasecon2014/cross-site-bigtable-using-hbase-ppt/jcr:content/mainContent/resourcecomponent.img.png/1418342561185.png
    Cross-Site BigTable using HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    As HBase continues to expand in application and enterprise or government deployments, there is a growing demand for storing data across geographically distributed datacenters for improved availability and disaster recovery.
  8. /content/cloudera/en/resources/library/hbasecon2014/real-time-hbase--lessons-from-the-cloud---ppt/jcr:content/mainContent/resourcecomponent.img.png/1418342561185.png
    Real-time HBase: Lessons from the Cloud
    • Monday, May 05 2014
    • Category: Presentation Slides, HBaseCon
    Running HBase in real time in the cloud provides an interesting and ever-changing set of challenges -- instance types are not ideal, neighbors can degrade your performance, and instances can randomly die in unanticipated ways. This talk will cover what HubSpot has learned about running in production on Amazon EC2, how to handle DR and redundancy, and the tooling the team has found to be the most helpful.
  9. /content/cloudera/en/resources/library/hbasecon2014/from-mongodb-to-hbase-in-six-easy-months/jcr:content/mainContent/resourcecomponent.img.png/1418342561185.png
    From MongoDB to HBase in Six Easy Months
    • Monday, May 05 2014
    • Category: Presentation Slides, HBaseCon
    Pushing well past MongoDB's limits (2TB data every week) is an interesting exercise in operational frustration. It also severely hampers flexibility of design for new use cases. This talk covers the architectural journey from MongoDB/Redis to HBase at Optimizely -- including the performance, design flexibility, speed of implementation, and other gains made. It also covers the operational setup needed to monitor and maintain the system as well as lessons learned from the migration process itself.
  10. /content/cloudera/en/resources/library/hbasecon2014/bulk-loading-in-the-wild--ingesting-the-world-s-energy-data/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    Bulk Loading in the Wild: Ingesting the World's Energy Data
    • Monday, May 05 2014
    • Category: CDH, Presentation, Video
    HBase is designed to store your big data and provide low latency random access to that data. One of its most compelling features is Bulk Loading, which enables the generation of HFiles that can then be passed to the RegionServers. Opower's energy insights platform uses it to ingest the hundreds of millions of meter reads it receives daily from its partner utility companies. This presentation will walk you through the HBase Bulk Loading process and Opower's adoption of it as an important piece of its HBase ecosystem.