Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/hbasecon2014/blackbird--storing-billions-of-rows-a-couple-of-milliseconds-awa/jcr:content/mainContent/resourcecomponent.img.jpg/1405466605144.jpg
    Blackbird: Storing Billions of Rows a Couple of Milliseconds Away
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    Would you use HBase to make billions of rows available for real-time lookup under 10 ms with 99% guarantee? We, at Rocket Fuel, do just that.
  2. /content/cloudera/en/resources/library/hbasecon2014/digital-library-collection-management-using-hbase/jcr:content/mainContent/resourcecomponent.img.jpg/1405466679686.jpg
    Digital Library Collection Management using HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    This talk covers the value-add that HBase brings to digital collection management.
  3. /content/cloudera/en/resources/library/hbasecon2014/data-evolution-in-hbase/jcr:content/mainContent/resourcecomponent.img.jpg/1405466587326.jpg
    Data Evolution in HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    While the development of this software is often effectively managed through revision control systems, data itself is rarely modeled in a way that affords the same flexibility.
  4. /content/cloudera/en/resources/library/hbasecon2014/hbase-data-modeling-and-access-patterns-with-kite-sdk-ppt/jcr:content/mainContent/resourcecomponent.img.png/1405466533536.png
    HBase Data Modeling and Access Patterns with Kite SDK
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    This talk will focus on Kite's HBase support by covering Kite basics and moving through the specifics of working with HBase as a data source.
  5. /content/cloudera/en/resources/library/hbasecon2014/tasmo--building-hbase-applications-from-event-streams-ppt/jcr:content/mainContent/resourcecomponent.img.png/1405466429343.png
    Tasmo: Building HBase Applications From Event Streams
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    Tasmo is a system that enables application development on top of event streams and HBase.
  6. /content/cloudera/en/resources/library/hbasecon2014/design-patterns-for-building-360-degree-views-with-hbase-and-kij-ppt/jcr:content/mainContent/resourcecomponent.img.png/1405466510068.png
    Design Patterns for Building 360-degree Views with HBase and Kiji
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    This talk will introduce the concept of entity-centric storage, discuss what it means, what it enables for businesses, and how to develop an entity-centric system using the open-source Kiji framework and HBase.
  7. /content/cloudera/en/resources/library/hbasecon2014/cross-site-bigtable-using-hbase-ppt/jcr:content/mainContent/resourcecomponent.img.png/1405466484369.png
    Cross-Site BigTable using HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    As HBase continues to expand in application and enterprise or government deployments, there is a growing demand for storing data across geographically distributed datacenters for improved availability and disaster recovery.
  8. /content/cloudera/en/resources/library/hbasecon2014/large-scale-web-apps---pinterest/jcr:content/mainContent/resourcecomponent.img.jpg/1405466696232.jpg
    Large-scale Web Apps @ Pinterest
    • Monday, May 05 2014
    • Category: Video, HBaseCon, Presentation
    This talk briefly describes some of these applications, the underlying schema, and how our HBase setup stays highly available and performant despite billions of requests every week.
  9. /content/cloudera/en/resources/library/hbasecon2014/real-time-hbase--lessons-from-the-cloud---ppt/jcr:content/mainContent/resourcecomponent.img.png/1405465981881.png
    Real-time HBase: Lessons from the Cloud
    • Monday, May 05 2014
    • Category: Presentation Slides, HBaseCon
    Running HBase in real time in the cloud provides an interesting and ever-changing set of challenges -- instance types are not ideal, neighbors can degrade your performance, and instances can randomly die in unanticipated ways. This talk will cover what HubSpot has learned about running in production on Amazon EC2, how to handle DR and redundancy, and the tooling the team has found to be the most helpful.
  10. /content/cloudera/en/resources/library/hbasecon2014/hbase-read-high-availability-using-timeline-consistent-region-re/jcr:content/mainContent/resourcecomponent.img.jpg/1405466051863.jpg
    HBase Read High Availability Using Timeline-Consistent Region Replicas
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    HBase has ACID semantics within a row that make it a perfect candidate for a lot of real-time serving workloads. However, single homing a region to a server implies some periods of unavailability for the regions after a server crash. Although the mean time to recovery has improved a lot recently, for some use cases, it is still preferable to do possibly stale reads while the region is recovering. In this talk, you will get an overview of our design and implementation of region replicas in HBase, which provide timeline-consistent reads even when the primary region is unavailable or busy.