Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/recordedwebinar/best-practices-for-the-hadoop-data-warehouse-slides/jcr:content/mainContent/resourcecomponent.img.png/1407188576036.png
    Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
    • Thursday, May 29 2014
    • Category: Video, Why Consolidation Data Platform, Data processing ETL offload, Presentation Slides
    Dr. Ralph Kimball and Eli Collins describe standard data warehouse best practices in Hadoop and how to implement them within a Hadoop environment. This includes identification of dimensions and facts, managing primary keys, and handling slowly changing dimensions (SCDs) and conformed dimensions.
  2. /content/cloudera/en/resources/library/recordedwebinar/large-scale-machine-learning-with-apache-spark/jcr:content/mainContent/resourcecomponent.img.png/1405383605390.png
    Large Scale Machine Learning with Apache Spark
    • Wednesday, May 21 2014
    • Category: Recorded Webinars, Video, CDH, Predictive modeling, Cyber security, Fraud detection
    Spark offers a number of advantages over its predecessor MapReduce that make it ideal for large-scale machine learning. For example, Spark includes MLLib, a library of machine learning algorithms for large data. The presentation will cover the state of MLLib and the details of some of the scalable algorithms it includes, mainly K-means.
  3. /content/cloudera/en/resources/library/productdemo/sas-and-cloudera-demo/jcr:content/mainContent/resourcecomponent.img.png/1405556337477.png
    SAS and Cloudera Demo
    • Wednesday, May 07 2014
    • Category: Predictive modeling, Software Vendor (ISV), Video, Product Demos
    Watch this demo of SAS Visual Analytics where we explore example data from a potential super market wishing to create a new line of organic products.
  4. /content/cloudera/en/resources/library/recordedwebinar/sas-and-cloudera--analytics-at-scale/jcr:content/mainContent/resourcecomponent.img.png/1405383569146.png
    SAS® and Cloudera Analytics at Scale and Speed
    • Wednesday, May 07 2014
    • Category: Predictive modeling, Data hub, Business process optimization, Software Vendor (ISV), Video, CDH, Recorded Webinars
    Learn about SAS and Cloudera technical integration, how SAS builds on the enterprise data hub, and SAS In-Memory solutions for Hadoop and machine learning capabilities.
  5. /content/cloudera/en/resources/library/hbasecon2014/content-identification-using-hbase/jcr:content/mainContent/resourcecomponent.img.jpg/1405466622562.jpg
    Content Identification using HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    This presentation will review the options a developer has for HBase querying and retrieval of hash data.
  6. /content/cloudera/en/resources/library/hbasecon2014/hbase-at-bloomberg--high-availability-needs-for-the-financial-in/jcr:content/mainContent/resourcecomponent.img.jpg/1405466661799.jpg
    HBase at Bloomberg: High Availability Needs for the Financial Industry
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Document
    This talk covers data and analytics use cases at Bloomberg and operational challenges around HA. We'll explore the work currently being done under HBASE-10070, further extensions to it, and how this solution is qualitatively different to how failover is handled by Apache Cassandra.
  7. /content/cloudera/en/resources/library/hbasecon2014/blackbird--storing-billions-of-rows-a-couple-of-milliseconds-awa/jcr:content/mainContent/resourcecomponent.img.jpg/1405466605144.jpg
    Blackbird: Storing Billions of Rows a Couple of Milliseconds Away
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    Would you use HBase to make billions of rows available for real-time lookup under 10 ms with 99% guarantee? We, at Rocket Fuel, do just that.
  8. /content/cloudera/en/resources/library/hbasecon2014/digital-library-collection-management-using-hbase/jcr:content/mainContent/resourcecomponent.img.jpg/1405466679686.jpg
    Digital Library Collection Management using HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    This talk covers the value-add that HBase brings to digital collection management.
  9. /content/cloudera/en/resources/library/hbasecon2014/data-evolution-in-hbase/jcr:content/mainContent/resourcecomponent.img.jpg/1405466587326.jpg
    Data Evolution in HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation, Video
    While the development of this software is often effectively managed through revision control systems, data itself is rarely modeled in a way that affords the same flexibility.
  10. /content/cloudera/en/resources/library/hbasecon2014/large-scale-web-apps---pinterest/jcr:content/mainContent/resourcecomponent.img.jpg/1405466696232.jpg
    Large-scale Web Apps @ Pinterest
    • Monday, May 05 2014
    • Category: Video, HBaseCon, Presentation
    This talk briefly describes some of these applications, the underlying schema, and how our HBase setup stays highly available and performant despite billions of requests every week.