Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/hbasecon2014/blackbird--storing-billions-of-rows-a-couple-of-milliseconds-awa/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    Blackbird: Storing Billions of Rows a Couple of Milliseconds Away
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    Would you use HBase to make billions of rows available for real-time lookup under 10 ms with 99% guarantee? We, at Rocket Fuel, do just that.
  2. /content/cloudera/en/resources/library/hbasecon2014/bulk-loading-in-the-wild--ingesting-the-world-s-energy-data/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    Bulk Loading in the Wild: Ingesting the World's Energy Data
    • Monday, May 05 2014
    • Category: CDH, Presentation, Video
    HBase is designed to store your big data and provide low latency random access to that data. One of its most compelling features is Bulk Loading, which enables the generation of HFiles that can then be passed to the RegionServers. Opower's energy insights platform uses it to ingest the hundreds of millions of meter reads it receives daily from its partner utility companies. This presentation will walk you through the HBase Bulk Loading process and Opower's adoption of it as an important piece of its HBase ecosystem.
  3. /content/cloudera/en/resources/library/hbasecon2014/hbase-at-bloomberg--high-availability-needs-for-the-financial-in/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    HBase at Bloomberg: High Availability Needs for the Financial Industry
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Document
    This talk covers data and analytics use cases at Bloomberg and operational challenges around HA. We'll explore the work currently being done under HBASE-10070, further extensions to it, and how this solution is qualitatively different to how failover is handled by Apache Cassandra.
  4. /content/cloudera/en/resources/library/hbasecon2014/large-scale-web-apps---pinterest/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    Large-scale Web Apps @ Pinterest
    • Monday, May 05 2014
    • Category: Video, HBaseCon, Presentation
    This talk briefly describes some of these applications, the underlying schema, and how our HBase setup stays highly available and performant despite billions of requests every week.
  5. /content/cloudera/en/resources/library/hbasecon2014/hbase--where-online-meets-low-latency/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    HBase: Where Online Meets Low Latency
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    HBase is an online database so response latency is critical. This talk will examine sources of latency in HBase, detailing steps along the read and write paths. We'll examine the entire request lifecycle, from client to server and back again.
  6. /content/cloudera/en/resources/library/hbasecon2014/a-graph-service-for-global-web-entities-traversal-and-reputation/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    A Graph Service for Global Web Entities Traversal and Reputation Evaluation Based on HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    This presentation covers what problems we try to solve, what and how the design decisions we made, how we design such a graph model, and the graph computation tasks involved.
  7. /content/cloudera/en/resources/library/hbasecon2014/content-identification-using-hbase/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    Content Identification using HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    This presentation will review the options a developer has for HBase querying and retrieval of hash data.
  8. /content/cloudera/en/resources/library/hbasecon2014/real-time-hbase--lessons-from-the-cloud---operations-session-3/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    Real-time HBase: Lessons from the Cloud - Operations Session 3
    • Monday, Jun 16 2014
    • Category: HBaseCon, Presentation, Video
    Running HBase in real time in the cloud provides an interesting and ever-changing set of challenges -- instance types are not ideal, neighbors can degrade your performance, and instances can randomly die in unanticipated ways. This talk will cover what HubSpot has learned about running in production on Amazon EC2, how to handle DR and redundancy, and the tooling the team has found to be the most helpful.
  9. /content/cloudera/en/resources/library/hbasecon2014/data-evolution-in-hbase/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    Data Evolution in HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    While the development of this software is often effectively managed through revision control systems, data itself is rarely modeled in a way that affords the same flexibility.
  10. /content/cloudera/en/resources/library/hbasecon2014/hbase-design-patterns---yahoo--/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    HBase Design Patterns @ Yahoo!
    • Monday, May 05 2014
    • Category: Presentation, Video, HBaseCon
    This talk reviews some recurring HBase design patterns at Yahoo! and shares some learnings and experiences.