Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-real-time-model-scoring-in-recommender-systems-video/jcr:content/mainContent/resourcecomponent.img.png/1380671943974.png
    HBaseCon 2013 | Real-Time Model Scoring in Recommender Systems
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    WibiData's Jon Natkins and Juliet Hougland discuss how developers can use Apache HBase and Kiji to develop low-latency predictive models, using algorithms like clustering or collaborative filtering, and how to leverage those models in the context of a full application.
  2. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--honeycomb---mysql-backed-by-apache-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380671786272.png
    HBaseCon 2013 | Honeycomb - MySQL Backed by Apache HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Dan Burkert of Near Infinity explores the architecture of Honeycomb, its use cases, and dive into how Honeycomb dynamically implements a relational data model on top of HBase that allows for efficient querying.
  3. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--integration-of-apache-hive-and-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380671765363.png
    HBaseCon 2013 | Integration of Apache Hive and HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Understand the current status of using Hive for querying your data stored in HBase. The presentation includes a running example of a web table storing web crawl data in HBase, and Hive queries to that table for analysis.
  4. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--using-coprocessors-to-index-columns-in-an-elasticsearch-cluster-video/jcr:content/mainContent/resourcecomponent.img.png/1380671857541.png
    HBaseCon 2013 | Using Coprocessors to Index Columns in an Elasticsearch Cluster
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Dibyendu Bhattacharya of HappiestMinds explores the design and challenges HappiestMinds faced while implementing a storage and search infrastructure for a large publisher where books/documents/artifacts related records are stored in Apache HBase.
  5. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--high-throughput-transactional-stream-processing-on-apache-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380671928170.png
    HBaseCon 2013 | High-Throughput, Transactional Stream Processing on Apache HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Continuuity's Andreas Neumann and Alex Baranau discuss transactional stream processing implementation on top of HBase, evaluate performance, scalability and reliability, and share experiences, best practices, and lessons learned.
  6. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-using-apache-hbase-for-large-matrices-video/jcr:content/mainContent/resourcecomponent.img.png/1380671959672.png
    HBaseCon 2013 | Using Apache HBase for Large Matrices
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Dilisim's Gokhan Capan describes HBase-backed versions of Mahout matrices that allow easy access and manipulation of matrix elements, do common matrix operations, and input persistent matrices to existing machine learning algorithms.
  7. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-streaming-data-into-apache-hbase-using-apache-flume-experience-with-high-speed-writes/jcr:content/mainContent/resourcecomponent.img.png/1380671911824.png
    HBaseCon 2013 | Streaming Data into Apache HBase using Apache Flume: Experience with High Speed Writes
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Cloudera's Hari Shreedharan discusses lessons learned while using the standard and async API, retrying puts and increments, and fine tuning batches to make sure we get optimum performance with minimal number of duplicates.
  8. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--apache-hbase-apache-hadoop-dna-and-you-video/jcr:content/mainContent/resourcecomponent.img.png/1380672151153.png
    HBaseCon 2013 | Apache HBase, Apache Hadoop, DNA and YOU!
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    In this talk from Jeremy Pollack, find out how Ancestry DNA leveraged Hadoop and HBase to implement a scalable cleanroom implementation of the GERMLINE algorithm, resulting in a 1700% performance improvement.
  9. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--mixing-low-latency-with-analytical-workloads-for-customer-experience-management-video/jcr:content/mainContent/resourcecomponent.img.png/1380672120620.png
    HBaseCon 2013 | Mixing Low Latency with Analytical Workloads for Customer Experience Management
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Causata’s Neil Ferguson will shares the challenges they overcame getting HBase to build customer profiles from many millions of unaggregated data points per second, per server, from many TBs of data.
  10. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--multi-tenant-apache-hbase-at-yahoo-video/jcr:content/mainContent/resourcecomponent.img.png/1380672179570.png
    HBaseCon 2013 | Multi-tenant Apache HBase at Yahoo!
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Sumeet Singh and Francis Liu (Yahoo!) cover traditional use cases for HBase at Yahoo!, and new use cases as a result in content management, advertising, log processing, analytics and reporting, recommendation graphs, and dimension data stores.