Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. Taming HBase with Apache Phoenix and SQL
    • Monday, May 05 2014
    • Category: Video, HBaseCon, Presentation
    Come learn about the fundamentals of Apache Phoenix and how it hides the complexities of HBase while giving you optimal performance, and hear about new features from our recent release, including updatable views that share the same physical HBase table and n-way equi-joins through a broadcast hash join mechanism.
  2. /content/cloudera/en/resources/library/hbasecon2014/hbase--where-online-meets-low-latency-ppt/jcr:content/mainContent/resourcecomponent.img.png/1405466236336.png
    HBase: Where Online Meets Low Latency
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    HBase is an online database so response latency is critical. This talk will examine sources of latency in HBase, detailing steps along the read and write paths. We'll examine the entire request lifecycle, from client to server and back again.
  3. HBase: Extreme Makeover
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    This talks introduces a totally new implementation of a multilayer caching in HBase called BigBase. BigBase has a big advantage over HBase 0.94/0.96 because of an ability to utilize all available server RAM in the most efficient way, and because of a novel implementation of a L3 level cache on fast SSDs. The talk will show that different type of caches in BigBase work best for different type of workloads, and that a combination of these caches (L1/L2/L3) increases the overall performance of HBase by a very wide margin.
  4. State of HBase: Meet the Release Managers
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    HBase release managers Lars Hofhansl, Andrew Purtell, Enis Soztutar, Michael Stack, and Liyin Tang jointly present highlights from their releases, and take your questions throughout.
  5. /content/cloudera/en/resources/library/hbasecon2014/state-of-hbase--meet-the-release-managers-ppt/jcr:content/mainContent/resourcecomponent.img.png/1405466258607.png
    State of HBase: Meet the Release Managers
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    HBase release managers Lars Hofhansl, Andrew Purtell, Enis Soztutar, Michael Stack, and Liyin Tang jointly present highlights from their releases, and take your questions throughout.
  6. /content/cloudera/en/resources/library/hbasecon2014/real-time-hbase--lessons-from-the-cloud---ppt/jcr:content/mainContent/resourcecomponent.img.png/1405465981881.png
    Real-time HBase: Lessons from the Cloud
    • Monday, May 05 2014
    • Category: Presentation Slides, HBaseCon
    Running HBase in real time in the cloud provides an interesting and ever-changing set of challenges -- instance types are not ideal, neighbors can degrade your performance, and instances can randomly die in unanticipated ways. This talk will cover what HubSpot has learned about running in production on Amazon EC2, how to handle DR and redundancy, and the tooling the team has found to be the most helpful.
  7. /content/cloudera/en/resources/library/hbasecon2014/from-mongodb-to-hbase-in-six-easy-months/jcr:content/mainContent/resourcecomponent.img.png/1405466034032.png
    From MongoDB to HBase in Six Easy Months
    • Monday, May 05 2014
    • Category: Presentation Slides, HBaseCon
    Pushing well past MongoDB's limits (2TB data every week) is an interesting exercise in operational frustration. It also severely hampers flexibility of design for new use cases. This talk covers the architectural journey from MongoDB/Redis to HBase at Optimizely -- including the performance, design flexibility, speed of implementation, and other gains made. It also covers the operational setup needed to monitor and maintain the system as well as lessons learned from the migration process itself.
  8. /content/cloudera/en/resources/library/hbasecon2014/harmonizing-multi-tenant-hbase-clusters-for-managing-workload-diversity/jcr:content/mainContent/resourcecomponent.img.png/1405465936057.png
    Harmonizing Multi-tenant HBase Clusters for Managing Workload Diversity - Operations Session 1
    • Monday, May 05 2014
    • Category: Presentation Slides, HBaseCon
    In early 2013, Yahoo! introduced multi-tenancy to HBase to offer it as a platform service for all Hadoop users. A certain degree of customization per tenant (a user or a project) was achieved through RegionServer groups, namespaces, and customized configs for each tenant. This talk covers how to accommodate diverse needs to individual tenants on the cluster, as well as operational tips and techniques that allow Yahoo! to automate the management of multi-tenant clusters at petabyte scale without errors.
  9. Smooth Operators Panel - Operations Session 7
    • Monday, May 05 2014
    • Category: Video, Presentation, HBaseCon
    Panel discussion with speaker's from Facebook, Pinterest, and Flurry.
  10. HBase: Where Online Meets Low Latency
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    HBase is an online database so response latency is critical. This talk will examine sources of latency in HBase, detailing steps along the read and write paths. We'll examine the entire request lifecycle, from client to server and back again.