Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--apache-hadoop-and-apache-hbase-for-real-time-video-analytics-video/jcr:content/mainContent/resourcecomponent.img.png/1380672055505.png
    HBaseCon 2013 | Apache Hadoop and Apache HBase for Real-Time Video Analytics
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    LongTail Video recently launched a real-time video analytics service on top of Hadoop and HBase running on Amazon’s AWS cloud. In this talk, LongTail's Suman Srinivasan discusses its architecture, specifically how it uses Hadoop for real-time analytics by processing data in frequent batches, and its experience with HBase for ingesting millions of aggregate data points and providing real-time results.
  2. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-project-valta-a-resource-management-layer-over-apache-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380671976509.png
    HBaseCon 2013 | Project Valta -- A Resource Management Layer over Apache HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Valta is an open-source project that acts as a layer between the user and the HBase API, employing client and server side mechanisms to guard precious resources. Lars George and Andrew Wang of Cloudera present.
  3. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-deal-personalization-engine-with-hbase--groupon-video/jcr:content/mainContent/resourcecomponent.img.png/1380672023340.png
    HBaseCon 2013 | Deal Personalization Engine with HBase @ Groupon
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    As Groupon's Ameya Kantikar explains, HBase now powers most of the backend technology for real time delivery of “deal” experience across all platforms, as well as powers our batch clusters for consolidated user data. We have over 40 billion data points in our HBase clusters.
  4. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--rebuilding-for-scale-on-apache-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380672072907.png
    HBaseCon 2013 | Rebuilding for Scale on Apache HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Simply Measured originally built out its entire data storage platform on MongoDB. Things seemed rosy for a while, but they were crumbling at the edges. Here Rob Roland covers why it chose HBase, how it integrated HBase with the least amount of downtime and impact to our customers, the financial costs of this migration, and where it's going in the future with it.
  5. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--evolving-a-first-generation-apache-hbase-deployment-to-second-generation-and-beyond-v/jcr:content/mainContent/resourcecomponent.img.png/1380672136397.png
    HBaseCon 2013 | Evolving a First-Generation Apache HBase Deployment to Second Generation and Beyond
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Explorys has been using HBase and Hadoop since HBase 0.20. Here Doug Meil walks through lessons learned over years of usage from its first HBase implementation through a series of upgrades and changes, including impacts to schema design, data loading, data indexing, data access and analytics, and operational processes.
  6. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--realtime-user-segmentation-using-apache-hbase-architectural-case-study-video/jcr:content/mainContent/resourcecomponent.img.png/1380672088324.png
    HBaseCon 2013 | Realtime User Segmentation using Apache HBase: Architectural Case Study
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    RichRelevance's Murtaza Doctor and Giang Nguyen demonstrate not only how the events are captured, but also how they are stored in HBase in real-time.
  7. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--mixing-low-latency-with-analytical-workloads-for-customer-experience-management-video/jcr:content/mainContent/resourcecomponent.img.png/1380672120620.png
    HBaseCon 2013 | Mixing Low Latency with Analytical Workloads for Customer Experience Management
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Causata’s Neil Ferguson will shares the challenges they overcame getting HBase to build customer profiles from many millions of unaggregated data points per second, per server, from many TBs of data.
  8. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-streaming-data-into-apache-hbase-using-apache-flume-experience-with-high-speed-writes/jcr:content/mainContent/resourcecomponent.img.png/1380671911824.png
    HBaseCon 2013 | Streaming Data into Apache HBase using Apache Flume: Experience with High Speed Writes
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Cloudera's Hari Shreedharan discusses lessons learned while using the standard and async API, retrying puts and increments, and fine tuning batches to make sure we get optimum performance with minimal number of duplicates.
  9. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-real-time-model-scoring-in-recommender-systems-video/jcr:content/mainContent/resourcecomponent.img.png/1380671943974.png
    HBaseCon 2013 | Real-Time Model Scoring in Recommender Systems
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    WibiData's Jon Natkins and Juliet Hougland discuss how developers can use Apache HBase and Kiji to develop low-latency predictive models, using algorithms like clustering or collaborative filtering, and how to leverage those models in the context of a full application.
  10. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-using-apache-hbase-for-large-matrices-video/jcr:content/mainContent/resourcecomponent.img.png/1380671959672.png
    HBaseCon 2013 | Using Apache HBase for Large Matrices
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Dilisim's Gokhan Capan describes HBase-backed versions of Mahout matrices that allow easy access and manipulation of matrix elements, do common matrix operations, and input persistent matrices to existing machine learning algorithms.