Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-using-apache-hbase-for-large-matrices-video/jcr:content/mainContent/resourcecomponent.img.png/1380671959672.png
    HBaseCon 2013 | Using Apache HBase for Large Matrices
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Dilisim's Gokhan Capan describes HBase-backed versions of Mahout matrices that allow easy access and manipulation of matrix elements, do common matrix operations, and input persistent matrices to existing machine learning algorithms.
  2. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-streaming-data-into-apache-hbase-using-apache-flume-experience-with-high-speed-writes/jcr:content/mainContent/resourcecomponent.img.png/1380671911824.png
    HBaseCon 2013 | Streaming Data into Apache HBase using Apache Flume: Experience with High Speed Writes
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Cloudera's Hari Shreedharan discusses lessons learned while using the standard and async API, retrying puts and increments, and fine tuning batches to make sure we get optimum performance with minimal number of duplicates.
  3. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--apache-hbase-apache-hadoop-dna-and-you-video/jcr:content/mainContent/resourcecomponent.img.png/1380672151153.png
    HBaseCon 2013 | Apache HBase, Apache Hadoop, DNA and YOU!
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    In this talk from Jeremy Pollack, find out how Ancestry DNA leveraged Hadoop and HBase to implement a scalable cleanroom implementation of the GERMLINE algorithm, resulting in a 1700% performance improvement.
  4. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--mixing-low-latency-with-analytical-workloads-for-customer-experience-management-video/jcr:content/mainContent/resourcecomponent.img.png/1380672120620.png
    HBaseCon 2013 | Mixing Low Latency with Analytical Workloads for Customer Experience Management
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Causata’s Neil Ferguson will shares the challenges they overcame getting HBase to build customer profiles from many millions of unaggregated data points per second, per server, from many TBs of data.
  5. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--multi-tenant-apache-hbase-at-yahoo-video/jcr:content/mainContent/resourcecomponent.img.png/1380672179570.png
    HBaseCon 2013 | Multi-tenant Apache HBase at Yahoo!
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Sumeet Singh and Francis Liu (Yahoo!) cover traditional use cases for HBase at Yahoo!, and new use cases as a result in content management, advertising, log processing, analytics and reporting, recommendation graphs, and dimension data stores.
  6. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--evolving-a-first-generation-apache-hbase-deployment-to-second-generation-and-beyond-v/jcr:content/mainContent/resourcecomponent.img.png/1380672136397.png
    HBaseCon 2013 | Evolving a First-Generation Apache HBase Deployment to Second Generation and Beyond
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Explorys has been using HBase and Hadoop since HBase 0.20. Here Doug Meil walks through lessons learned over years of usage from its first HBase implementation through a series of upgrades and changes, including impacts to schema design, data loading, data indexing, data access and analytics, and operational processes.
  7. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--apache-hbase-and-hdfs---understanding-filesystem-usage-in-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380664797201.png
    HBaseCon 2013 | Apache HBase and HDFS - Understanding Filesystem Usage in HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    In this talk from Hortonworks' Enis Soztutar, Enis takes an HDFS-centric look at the filesystem issues in HBase. He dissects the interface between HBase and HDFS, with a focus on the filesystem services that HBase relies on, durability, crash recovery, and performance characteristics of HBase resulting from using HDFS.
  8. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--apache-hbase-replication-video/jcr:content/mainContent/resourcecomponent.img.png/1380664995988.png
    HBaseCon 2013 | Apache HBase Replication
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Twitter's Chris Trezzo gives a brief user tutorial of replication, looks at a few example use cases, explores the high-level core architecture, and takes a detailed look at implementation.
  9. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--compaction-improvements-in-apache-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380664968185.png
    HBaseCon 2013 | Compaction Improvements in Apache HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Compactions are a critical aspect of HBase storage design, yet they are frequently a pain point in cluster management, affecting the availability and requiring manual tuning. This talk from Hortonworks' Sergey Shelukhin provides a brief overview of existing HBase compaction algorithm, the problems it encounters in specific data scenarios and in normal operation, as well as improvements recently made to it.
  10. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--apache-hbase-operations-at-pinterest-video/jcr:content/mainContent/resourcecomponent.img.png/1380664517451.png
    HBaseCon 2013 | Apache HBase Operations at Pinterest
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    This presentation from Pinterest's Jeremy Carroll explains how Pinterest operates HBase on Amazon EC2 with success.