Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. HBase at Xiaomi
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    This talk covers the HBase environment at Xiaomi, including thoughts and practices around latency, hardware/OS/VM configuration, GC tuning, the use of a new write thread model and reverse scan, and block index optimization. It will also include some discussion of planned JIRAs based on these approaches.
  2. HBase Read High Availability Using Timeline-Consistent Region Replicas
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    HBase has ACID semantics within a row that make it a perfect candidate for a lot of real-time serving workloads. However, single homing a region to a server implies some periods of unavailability for the regions after a server crash. Although the mean time to recovery has improved a lot recently, for some use cases, it is still preferable to do possibly stale reads while the region is recovering. In this talk, you will get an overview of our design and implementation of region replicas in HBase, which provide timeline-consistent reads even when the primary region is unavailable or busy.
  3. /content/cloudera/en/resources/library/hbasecon2014/hbase--extreme-makeover-ppt/jcr:content/mainContent/resourcecomponent.img.png/1405466278104.png
    HBase: Extreme Makeover
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    This talks introduces a totally new implementation of a multilayer caching in HBase called BigBase. BigBase has a big advantage over HBase 0.94/0.96 because of an ability to utilize all available server RAM in the most efficient way, and because of a novel implementation of a L3 level cache on fast SSDs. The talk will show that different type of caches in BigBase work best for different type of workloads, and that a combination of these caches (L1/L2/L3) increases the overall performance of HBase by a very wide margin.
  4. /content/cloudera/en/resources/library/hbasecon2014/taming-hbase-with-apache-phoenix-and-sql-ppt/jcr:content/mainContent/resourcecomponent.img.png/1405466412113.png
    Taming HBase with Apache Phoenix and SQL
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    Come learn about the fundamentals of Phoenix and how it hides the complexities of HBase while giving you optimal performance, and hear about new features from our recent release, including updatable views that share the same physical HBase table and n-way equi-joins through a broadcast hash join mechanism.
  5. /content/cloudera/en/resources/library/hbasecon2014/tasmo--building-hbase-applications-from-event-streams-ppt/jcr:content/mainContent/resourcecomponent.img.png/1405466429343.png
    Tasmo: Building HBase Applications From Event Streams
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    Tasmo is a system that enables application development on top of event streams and HBase.
  6. Design Patterns for Building 360-degree Views with HBase and Kiji
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    This talk will introduce the concept of entity-centric storage, discuss what it means, what it enables for businesses, and how to develop an entity-centric system using the open-source Kiji framework and HBase.
  7. OpenTSDB 2.0
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    The OpenTSDB community continues to grow and with users looking to store massive amounts of time-series data in a scalable manner. In this talk, we will discuss a number of use cases and best practices around naming schemas and HBase configuration. We will also review OpenTSDB 2.0's new features, including the HTTP API, plugins, annotations, millisecond support, and metadata, as well as what's next in the roadmap.
  8. HBase Data Modeling and Access Patterns with Kite SDK
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    This talk will focus on Kite's HBase support by covering Kite basics and moving through the specifics of working with HBase as a data source.
  9. /content/cloudera/en/resources/library/hbasecon2014/design-patterns-for-building-360-degree-views-with-hbase-and-kij-ppt/jcr:content/mainContent/resourcecomponent.img.png/1405466510068.png
    Design Patterns for Building 360-degree Views with HBase and Kiji
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    This talk will introduce the concept of entity-centric storage, discuss what it means, what it enables for businesses, and how to develop an entity-centric system using the open-source Kiji framework and HBase.
  10. /content/cloudera/en/resources/library/hbasecon2014/cross-site-bigtable-using-hbase-ppt/jcr:content/mainContent/resourcecomponent.img.png/1405466484369.png
    Cross-Site BigTable using HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    As HBase continues to expand in application and enterprise or government deployments, there is a growing demand for storing data across geographically distributed datacenters for improved availability and disaster recovery.