Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/hbasecon2014/tales-from-the-cloudera-field---operations-session-4/jcr:content/mainContent/resourcecomponent.img.jpg/1405465861861.jpg
    Tales from the Cloudera Field - Operations Session 4
    • Monday, Jun 16 2014
    • Category: Presentation, HBaseCon, Video
    From supporting the 0.90.x, 0.92, 0.94, and 0.96 HBase installations on clusters ranging from tens to hundreds of nodes, Cloudera has seen it all. Having automated the upgrade paths from the different Apache releases, we have developed a smooth path that can help the community with upcoming upgrades. In addition to automation best practices, in this talk you'll also learn proactive configuration tweaks and operational best practices to keep your HBase cluster always up and running. We'll also walk through how to contain an application bug let loose in production, to minimize the impact on HBase posed by faulty hardware, and the direct correlation between inefficient schema design and HBase performance.
  2. /content/cloudera/en/resources/library/hbasecon2014/real-time-hbase--lessons-from-the-cloud---operations-session-3/jcr:content/mainContent/resourcecomponent.img.jpg/1405465841711.jpg
    Real-time HBase: Lessons from the Cloud - Operations Session 3
    • Monday, Jun 16 2014
    • Category: HBaseCon, Presentation, Video
    Running HBase in real time in the cloud provides an interesting and ever-changing set of challenges -- instance types are not ideal, neighbors can degrade your performance, and instances can randomly die in unanticipated ways. This talk will cover what HubSpot has learned about running in production on Amazon EC2, how to handle DR and redundancy, and the tooling the team has found to be the most helpful.
  3. /content/cloudera/en/resources/library/hbasecon2014/hbase-backups---operations-session-5/jcr:content/mainContent/resourcecomponent.img.jpg/1405465878235.jpg
    Operations Session 5 - HBase Backups
    • Monday, Jun 16 2014
    • Category: Presentation, Video, HBaseCon
    This talk provides an overview of enterprise-scale backup strategies for HBase: Jesse Yates will describe how Salesforce.com runs backup and recovery on its multi-tenant, enterprise scale HBase deploys; Demai Ni, Songqinq Ding, and Jing Chen of the IBM InfoSphere BigInsights development team will then follow with a description of IBM's recently open-sourced disaster/recovery solution based on HBase snapshots and replication.
  4. /content/cloudera/en/resources/library/hbasecon2014/Harmonizing-Multi-Tenant-HBase-Clusters-for-Managing-Workload-Diversity/jcr:content/mainContent/resourcecomponent.img.jpg/1405465783974.jpg
    HBaseCon 2014 | Harmonizing Multi-tenant HBase Clusters for Managing Workload Diversity -Operations Session 1
    • Thursday, Jun 05 2014
    • Category: HBaseCon, Video, Presentation
    In early 2013, Yahoo! introduced multi-tenancy to HBase to offer it as a platform service for all Hadoop users. A certain degree of customization per tenant (a user or a project) was achieved through RegionServer groups, namespaces, and customized configs for each tenant. This talk covers how to accommodate diverse needs to individual tenants on the cluster, as well as operational tips and techniques that allow Yahoo! to automate the management of multi-tenant clusters at petabyte scale without errors.
  5. /content/cloudera/en/resources/library/hbasecon2014/content-identification-using-hbase/jcr:content/mainContent/resourcecomponent.img.jpg/1405466622562.jpg
    Content Identification using HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    This presentation will review the options a developer has for HBase querying and retrieval of hash data.
  6. /content/cloudera/en/resources/library/hbasecon2014/hbase-at-bloomberg--high-availability-needs-for-the-financial-in/jcr:content/mainContent/resourcecomponent.img.jpg/1405466661799.jpg
    HBase at Bloomberg: High Availability Needs for the Financial Industry
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Document
    This talk covers data and analytics use cases at Bloomberg and operational challenges around HA. We'll explore the work currently being done under HBASE-10070, further extensions to it, and how this solution is qualitatively different to how failover is handled by Apache Cassandra.
  7. /content/cloudera/en/resources/library/hbasecon2014/opentsdb-2-0-ppt/jcr:content/mainContent/resourcecomponent.img.png/1405466560955.png
    OpenTSDB 2.0
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    The OpenTSDB community continues to grow and with users looking to store massive amounts of time-series data in a scalable manner. In this talk, we will discuss a number of use cases and best practices around naming schemas and HBase configuration. We will also review OpenTSDB 2.0's new features, including the HTTP API, plugins, annotations, millisecond support, and metadata, as well as what's next in the roadmap.
  8. /content/cloudera/en/resources/library/hbasecon2014/blackbird--storing-billions-of-rows-a-couple-of-milliseconds-awa/jcr:content/mainContent/resourcecomponent.img.jpg/1405466605144.jpg
    Blackbird: Storing Billions of Rows a Couple of Milliseconds Away
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    Would you use HBase to make billions of rows available for real-time lookup under 10 ms with 99% guarantee? We, at Rocket Fuel, do just that.
  9. /content/cloudera/en/resources/library/hbasecon2014/digital-library-collection-management-using-hbase/jcr:content/mainContent/resourcecomponent.img.jpg/1405466679686.jpg
    Digital Library Collection Management using HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    This talk covers the value-add that HBase brings to digital collection management.
  10. /content/cloudera/en/resources/library/hbasecon2014/data-evolution-in-hbase/jcr:content/mainContent/resourcecomponent.img.jpg/1405466587326.jpg
    Data Evolution in HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    While the development of this software is often effectively managed through revision control systems, data itself is rarely modeled in a way that affords the same flexibility.