Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--compaction-improvements-in-apache-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380664968185.png
    HBaseCon 2013 | Compaction Improvements in Apache HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Compactions are a critical aspect of HBase storage design, yet they are frequently a pain point in cluster management, affecting the availability and requiring manual tuning. This talk from Hortonworks' Sergey Shelukhin provides a brief overview of existing HBase compaction algorithm, the problems it encounters in specific data scenarios and in normal operation, as well as improvements recently made to it.
  2. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--apache-hbase-and-hdfs---understanding-filesystem-usage-in-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380664797201.png
    HBaseCon 2013 | Apache HBase and HDFS - Understanding Filesystem Usage in HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    In this talk from Hortonworks' Enis Soztutar, Enis takes an HDFS-centric look at the filesystem issues in HBase. He dissects the interface between HBase and HDFS, with a focus on the filesystem services that HBase relies on, durability, crash recovery, and performance characteristics of HBase resulting from using HDFS.
  3. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--panel---apache-hbase-futures-video/jcr:content/mainContent/resourcecomponent.img.png/1380664653016.png
    HBaseCon 2013 | Panel - Apache HBase Futures
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    This panel addresses the new development efforts that are moving the HBase code base into the future — as well as ones on the community’s wish list.
  4. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--multi-tenant-apache-hbase-at-yahoo-video/jcr:content/mainContent/resourcecomponent.img.png/1380672179570.png
    HBaseCon 2013 | Multi-tenant Apache HBase at Yahoo!
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Sumeet Singh and Francis Liu (Yahoo!) cover traditional use cases for HBase at Yahoo!, and new use cases as a result in content management, advertising, log processing, analytics and reporting, recommendation graphs, and dimension data stores.
  5. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--apache-hbase-apache-hadoop-dna-and-you-video/jcr:content/mainContent/resourcecomponent.img.png/1380672151153.png
    HBaseCon 2013 | Apache HBase, Apache Hadoop, DNA and YOU!
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    In this talk from Jeremy Pollack, find out how Ancestry DNA leveraged Hadoop and HBase to implement a scalable cleanroom implementation of the GERMLINE algorithm, resulting in a 1700% performance improvement.
  6. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--etl-for-apache-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380672105460.png
    HBaseCon 2013 | ETL for Apache HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Experian sends 700+ million emails daily, which it analyzes in real time to see campaign performance and create new segments. Experian’s ETL framework can source data from various systems, transform it, and persist in HBase. Experian's Manoj Khanwalkar and Govind Asawa explain the details here.
  7. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--apache-hbase-at-pinterest-scaling-our-feed-storage-video/jcr:content/mainContent/resourcecomponent.img.png/1380672039537.png
    HBaseCon 2013 | Apache HBase at Pinterest -- Scaling Our Feed Storage
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    At Pinterest, we have been increasingly using HBase for a variety of applications – real-time, interactive, and batch oriented. In this talk, Pinterest's Varun Sharma discusses its experience with architecting and scaling our Feed storage on HBase. “Feeds” are central to user experience at Pinterest and lie on a critical path for user requests.
  8. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--near-real-time-indexing-for-ebay-search-video/jcr:content/mainContent/resourcecomponent.img.png/1380671992294.png
    HBaseCon 2013 | Near Real Time Indexing for eBay Search
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    eBay search powers search on the ebay.com website and is in the critical path of eBay’s user experience and revenue. Sellers and buyer are continuously updating the underlying data ecosystem and the Search system has to process these changes in near real time so that the search results can reflect the updated reality and provide a good user experience. Here Swati Agarwal and Raj Tanneru of eBay talk about eBay’s new search indexing platform and in particular the near real time indexing platform.
  9. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--being-smarter-than-the-smart-meter-video/jcr:content/mainContent/resourcecomponent.img.png/1380672007550.png
    HBaseCon 2013 | Being Smarter Than the Smart Meter
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Smart Meters and upstream grid sensors are producing a lot of data every day. Harnessing this data for advanced grid analytics is a requirement for the smart utility. As Oracle's Jay Talreja explains, DataRaker, now part of the Oracle Utility Software Suite, was architected on HBase to scale to the largest smart meter deployments in the world.
  10. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--apache-hadoop-and-apache-hbase-for-real-time-video-analytics-video/jcr:content/mainContent/resourcecomponent.img.png/1380672055505.png
    HBaseCon 2013 | Apache Hadoop and Apache HBase for Real-Time Video Analytics
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    LongTail Video recently launched a real-time video analytics service on top of Hadoop and HBase running on Amazon’s AWS cloud. In this talk, LongTail's Suman Srinivasan discusses its architecture, specifically how it uses Hadoop for real-time analytics by processing data in frequent batches, and its experience with HBase for ingesting millions of aggregate data points and providing real-time results.