Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/hbasecon2014/hbase-at-xiaomi-ppt/jcr:content/mainContent/resourcecomponent.img.png/1405466220228.png
    HBase at Xiaomi
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    This talk covers the HBase environment at Xiaomi, including thoughts and practices around latency, hardware/OS/VM configuration, GC tuning, the use of a new write thread model and reverse scan, and block index optimization. It will also include some discussion of planned JIRAs based on these approaches.
  2. /content/cloudera/en/resources/library/hbasecon2014/hbase--extreme-makeover/jcr:content/mainContent/resourcecomponent.img.jpg/1405466170157.jpg
    HBase: Extreme Makeover
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    This talks introduces a totally new implementation of a multilayer caching in HBase called BigBase. BigBase has a big advantage over HBase 0.94/0.96 because of an ability to utilize all available server RAM in the most efficient way, and because of a novel implementation of a L3 level cache on fast SSDs. The talk will show that different type of caches in BigBase work best for different type of workloads, and that a combination of these caches (L1/L2/L3) increases the overall performance of HBase by a very wide margin.
  3. /content/cloudera/en/resources/library/hbasecon2014/state-of-hbase--meet-the-release-managers/jcr:content/mainContent/resourcecomponent.img.jpg/1405466128647.jpg
    State of HBase: Meet the Release Managers
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    HBase release managers Lars Hofhansl, Andrew Purtell, Enis Soztutar, Michael Stack, and Liyin Tang jointly present highlights from their releases, and take your questions throughout.
  4. /content/cloudera/en/resources/library/hbasecon2014/hbase-at-xiaomi/jcr:content/mainContent/resourcecomponent.img.jpg/1405466100705.jpg
    HBase at Xiaomi
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    This talk covers the HBase environment at Xiaomi, including thoughts and practices around latency, hardware/OS/VM configuration, GC tuning, the use of a new write thread model and reverse scan, and block index optimization. It will also include some discussion of planned JIRAs based on these approaches.
  5. /content/cloudera/en/resources/library/hbasecon2014/hbase--where-online-meets-low-latency/jcr:content/mainContent/resourcecomponent.img.jpg/1405466114490.jpg
    HBase: Where Online Meets Low Latency
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    HBase is an online database so response latency is critical. This talk will examine sources of latency in HBase, detailing steps along the read and write paths. We'll examine the entire request lifecycle, from client to server and back again.
  6. /content/cloudera/en/resources/library/hbasecon2014/new-security-features-in-apache-hbase-0-98--an-operator-s-guide/jcr:content/mainContent/resourcecomponent.img.jpg/1405466067895.jpg
    New Security Features in Apache HBase 0.98: An Operator's Guide
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    HBase 0.98 introduces several new security features: visibility labels, cell ACLs, transparent encryption, and coprocessor framework changes. This talk will cover the new capabilities available in HBase 0.98+, the threat models and use cases they cover, how these features stack up against other data stores in the Apache big data ecosystem, and how operators and security architects can take advantage of them.
  7. /content/cloudera/en/resources/library/hbasecon2014/hbase-read-high-availability-using-timeline-consistent-region-re/jcr:content/mainContent/resourcecomponent.img.jpg/1405466051863.jpg
    HBase Read High Availability Using Timeline-Consistent Region Replicas
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    HBase has ACID semantics within a row that make it a perfect candidate for a lot of real-time serving workloads. However, single homing a region to a server implies some periods of unavailability for the regions after a server crash. Although the mean time to recovery has improved a lot recently, for some use cases, it is still preferable to do possibly stale reads while the region is recovering. In this talk, you will get an overview of our design and implementation of region replicas in HBase, which provide timeline-consistent reads even when the primary region is unavailable or busy.
  8. /content/cloudera/en/resources/library/hbasecon2014/from-mongodb-to-hbase-in-six-easy-months/jcr:content/mainContent/resourcecomponent.img.png/1405466034032.png
    From MongoDB to HBase in Six Easy Months
    • Monday, May 05 2014
    • Category: Presentation Slides, HBaseCon
    Pushing well past MongoDB's limits (2TB data every week) is an interesting exercise in operational frustration. It also severely hampers flexibility of design for new use cases. This talk covers the architectural journey from MongoDB/Redis to HBase at Optimizely -- including the performance, design flexibility, speed of implementation, and other gains made. It also covers the operational setup needed to monitor and maintain the system as well as lessons learned from the migration process itself.
  9. /content/cloudera/en/resources/library/hbasecon2014/real-time-hbase--lessons-from-the-cloud---ppt/jcr:content/mainContent/resourcecomponent.img.png/1405465981881.png
    Real-time HBase: Lessons from the Cloud
    • Monday, May 05 2014
    • Category: Presentation Slides, HBaseCon
    Running HBase in real time in the cloud provides an interesting and ever-changing set of challenges -- instance types are not ideal, neighbors can degrade your performance, and instances can randomly die in unanticipated ways. This talk will cover what HubSpot has learned about running in production on Amazon EC2, how to handle DR and redundancy, and the tooling the team has found to be the most helpful.
  10. /content/cloudera/en/resources/library/hbasecon2014/tales-from-the-cloudera-field-ppt/jcr:content/mainContent/resourcecomponent.img.png/1405466015382.png
    Tales from the Cloudera Field
    • Monday, May 05 2014
    • Category: Presentation Slides, HBaseCon
    From supporting the 0.90.x, 0.92, 0.94, and 0.96 HBase installations on clusters ranging from tens to hundreds of nodes, Cloudera has seen it all. Having automated the upgrade paths from the different Apache releases, we have developed a smooth path that can help the community with upcoming upgrades. In addition to automation best practices, in this talk you'll also learn proactive configuration tweaks and operational best practices to keep your HBase cluster always up and running. We'll also walk through how to contain an application bug let loose in production, to minimize the impact on HBase posed by faulty hardware, and the direct correlation between inefficient schema design and HBase performance.