Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/recordedwebinar/big-data-search-bigger-insights/jcr:content/mainContent/resourcecomponent.img.png/1405382516791.png
    Cloudera Search Webinar: Big Data Search, Bigger Insights
    • Wednesday, Jun 19 2013
    • Category: CDH, Recorded Webinars, Video
    Cloudera Search brings full-text, interactive search and scalable indexing to data in HDFS and Apache HBase. Powered by and adding to Apache Solr, Cloudera Search fully integrates with CDH to bring scale and reliability for next-generation open source search — Big Data search.
  2. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-opentsdb-at-box-video/jcr:content/mainContent/resourcecomponent_1.img.png/1390329090906.png
    HBaseCon 2013 | OpenTSDB at Box
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    This video discusses how to leverage data through HBase and why the implementation in your company would be strategic.
  3. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--evolving-a-first-generation-apache-hbase-deployment-to-second-generation-and-beyond-v/jcr:content/mainContent/resourcecomponent.img.png/1380672136397.png
    HBaseCon 2013 | Evolving a First-Generation Apache HBase Deployment to Second Generation and Beyond
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Explorys has been using HBase and Hadoop since HBase 0.20. Here Doug Meil walks through lessons learned over years of usage from its first HBase implementation through a series of upgrades and changes, including impacts to schema design, data loading, data indexing, data access and analytics, and operational processes.
  4. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--multi-tenant-apache-hbase-at-yahoo-video/jcr:content/mainContent/resourcecomponent.img.png/1380672179570.png
    HBaseCon 2013 | Multi-tenant Apache HBase at Yahoo!
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Sumeet Singh and Francis Liu (Yahoo!) cover traditional use cases for HBase at Yahoo!, and new use cases as a result in content management, advertising, log processing, analytics and reporting, recommendation graphs, and dimension data stores.
  5. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--mixing-low-latency-with-analytical-workloads-for-customer-experience-management-video/jcr:content/mainContent/resourcecomponent.img.png/1380672120620.png
    HBaseCon 2013 | Mixing Low Latency with Analytical Workloads for Customer Experience Management
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Causata’s Neil Ferguson will shares the challenges they overcame getting HBase to build customer profiles from many millions of unaggregated data points per second, per server, from many TBs of data.
  6. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--apache-hbase-apache-hadoop-dna-and-you-video/jcr:content/mainContent/resourcecomponent.img.png/1380672151153.png
    HBaseCon 2013 | Apache HBase, Apache Hadoop, DNA and YOU!
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    In this talk from Jeremy Pollack, find out how Ancestry DNA leveraged Hadoop and HBase to implement a scalable cleanroom implementation of the GERMLINE algorithm, resulting in a 1700% performance improvement.
  7. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--etl-for-apache-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380672105460.png
    HBaseCon 2013 | ETL for Apache HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Experian sends 700+ million emails daily, which it analyzes in real time to see campaign performance and create new segments. Experian’s ETL framework can source data from various systems, transform it, and persist in HBase. Experian's Manoj Khanwalkar and Govind Asawa explain the details here.
  8. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--apache-hadoop-and-apache-hbase-for-real-time-video-analytics-video/jcr:content/mainContent/resourcecomponent.img.png/1380672055505.png
    HBaseCon 2013 | Apache Hadoop and Apache HBase for Real-Time Video Analytics
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    LongTail Video recently launched a real-time video analytics service on top of Hadoop and HBase running on Amazon’s AWS cloud. In this talk, LongTail's Suman Srinivasan discusses its architecture, specifically how it uses Hadoop for real-time analytics by processing data in frequent batches, and its experience with HBase for ingesting millions of aggregate data points and providing real-time results.
  9. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-using-apache-hbase-for-large-matrices-video/jcr:content/mainContent/resourcecomponent.img.png/1380671959672.png
    HBaseCon 2013 | Using Apache HBase for Large Matrices
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Dilisim's Gokhan Capan describes HBase-backed versions of Mahout matrices that allow easy access and manipulation of matrix elements, do common matrix operations, and input persistent matrices to existing machine learning algorithms.
  10. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-project-valta-a-resource-management-layer-over-apache-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380671976509.png
    HBaseCon 2013 | Project Valta -- A Resource Management Layer over Apache HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Valta is an open-source project that acts as a layer between the user and the HBase API, employing client and server side mechanisms to guard precious resources. Lars George and Andrew Wang of Cloudera present.