Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/hbasecon2014/hbase-read-high-availability-using-timeline-consistent-region-re/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    HBase Read High Availability Using Timeline-Consistent Region Replicas
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    HBase has ACID semantics within a row that make it a perfect candidate for a lot of real-time serving workloads. However, single homing a region to a server implies some periods of unavailability for the regions after a server crash. Although the mean time to recovery has improved a lot recently, for some use cases, it is still preferable to do possibly stale reads while the region is recovering. In this talk, you will get an overview of our design and implementation of region replicas in HBase, which provide timeline-consistent reads even when the primary region is unavailable or busy.
  2. /content/cloudera/en/resources/library/hbasecon2014/design-patterns-for-building-360-degree-views-with-hbase-and-kij-ppt/jcr:content/mainContent/resourcecomponent.img.png/1418342561185.png
    Design Patterns for Building 360-degree Views with HBase and Kiji
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    This talk will introduce the concept of entity-centric storage, discuss what it means, what it enables for businesses, and how to develop an entity-centric system using the open-source Kiji framework and HBase.
  3. /content/cloudera/en/resources/library/hbasecon2014/opentsdb-2-0/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    OpenTSDB 2.0
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    The OpenTSDB community continues to grow and with users looking to store massive amounts of time-series data in a scalable manner. In this talk, we will discuss a number of use cases and best practices around naming schemas and HBase configuration. We will also review OpenTSDB 2.0's new features, including the HTTP API, plugins, annotations, millisecond support, and metadata, as well as what's next in the roadmap.
  4. /content/cloudera/en/resources/library/hbasecon2014/new-security-features-in-apache-hbase-0-98--an-operator-s-guide-ppt/jcr:content/mainContent/resourcecomponent.img.png/1418342561185.png
    New Security Features in Apache HBase 0.98: An Operator's Guide
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    HBase 0.98 introduces several new security features: visibility labels, cell ACLs, transparent encryption, and coprocessor framework changes. This talk will cover the new capabilities available in HBase 0.98+, the threat models and use cases they cover, how these features stack up against other data stores in the Apache big data ecosystem, and how operators and security architects can take advantage of them.
  5. /content/cloudera/en/resources/library/hbasecon2014/hbase-backups---operations-session-5/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    Operations Session 5 - HBase Backups
    • Monday, Jun 16 2014
    • Category: Presentation, Video, HBaseCon
    This talk provides an overview of enterprise-scale backup strategies for HBase: Jesse Yates will describe how Salesforce.com runs backup and recovery on its multi-tenant, enterprise scale HBase deploys; Demai Ni, Songqinq Ding, and Jing Chen of the IBM InfoSphere BigInsights development team will then follow with a description of IBM's recently open-sourced disaster/recovery solution based on HBase snapshots and replication.
  6. /content/cloudera/en/resources/library/hbasecon2014/content-identification-using-hbase-ppt/jcr:content/mainContent/resourcecomponent.img.png/1418342561185.png
    Content Identification using HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    This presentation will review the options a developer has for HBase querying and retrieval of hash data.
  7. /content/cloudera/en/resources/library/hbasecon2014/state-of-hbase--meet-the-release-managers/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    State of HBase: Meet the Release Managers
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    HBase release managers Lars Hofhansl, Andrew Purtell, Enis Soztutar, Michael Stack, and Liyin Tang jointly present highlights from their releases, and take your questions throughout.
  8. /content/cloudera/en/resources/library/hbasecon2014/a-graph-service-for-global-web-entities-traversal-and-reputation-ppt/jcr:content/mainContent/resourcecomponent.img.png/1418342561185.png
    A Graph Service for Global Web Entities Traversal and Reputation Evaluation Based on HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    This presentation covers what problems we try to solve, what and how the design decisions we made, how we design such a graph model, and the graph computation tasks involved.
  9. /content/cloudera/en/resources/library/hbasecon2014/tasmo--building-hbase-applications-from-event-streams/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    Tasmo: Building HBase Applications From Event Streams
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    Tasmo is a system that enables application development on top of event streams and HBase.
  10. /content/cloudera/en/resources/library/hbasecon2014/tales-from-the-cloudera-field---operations-session-4/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    Tales from the Cloudera Field - Operations Session 4
    • Monday, Jun 16 2014
    • Category: Presentation, HBaseCon, Video
    From supporting the 0.90.x, 0.92, 0.94, and 0.96 HBase installations on clusters ranging from tens to hundreds of nodes, Cloudera has seen it all. Having automated the upgrade paths from the different Apache releases, we have developed a smooth path that can help the community with upcoming upgrades. In addition to automation best practices, in this talk you'll also learn proactive configuration tweaks and operational best practices to keep your HBase cluster always up and running. We'll also walk through how to contain an application bug let loose in production, to minimize the impact on HBase posed by faulty hardware, and the direct correlation between inefficient schema design and HBase performance.