Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/recordedwebinar/video--hadoop-security-iii---protecting-data-at-rest-and-in-moti/jcr:content/mainContent/resourcecomponent.img.png/1407174732858.png
    Comprehensive Security for the Enterprise III: Protecting Data at Rest and In Motion
    • Tuesday, Jul 22 2014
    • Category: Data hub, Financial Services, Security, Video, Recorded Webinars
    This webinar discusses how you can use Navigator capabilities such as Encrypt and Key Trustee to secure data and enable compliance. Additionally, we will discuss our joint work with Intel on Project Rhino (an initiative to improve data security in Hadoop). We also hear from a security architect at a financial services company that is using encryption and key management to meet financial regulatory requirements.
  2. /content/cloudera/en/resources/library/recordedwebinar/video--kite-sdk--working-with-datasets/jcr:content/mainContent/resourcecomponent.img.png/1407174600679.png
    Kite SDK: Working with Datasets
    • Thursday, Jul 17 2014
    • Category: apache hadoop, Open Source Cloudera, Recorded Webinars, Video
    The Kite SDK is an open source set of libraries, tools, examples, and documentation focused on helping developers build systems on top of the Apache Hadoop ecosystem. Learn (via examples) how Kite makes it easier to work with data in HDFS and Apache HBase as records and datasets, just as you would with a relational database.
  3. /content/cloudera/en/resources/library/recordedwebinar/video--hadoop-security-ii---guarding-the-perimeter-and-controlli/jcr:content/mainContent/resourcecomponent.img.png/1405383832994.png
    Comprehensive Security for the Enterprise II: Guarding the Perimeter and Controlling Access
    • Thursday, Jul 10 2014
    • Category: Video, Recorded Webinars, Cyber security
    One of the benefits of Hadoop is that it easily allows for multiple entry points both for data flow and user access. Here we discuss how Cloudera allows you to preserve the agility of having multiple entry points while also providing strong, easy to manage authentication. Additionally, we discuss how Cloudera provides unified authorization to easily control access for multiple data processing engines.
  4. /content/cloudera/en/resources/library/recordedwebinar/machine-learning-loves-hadoop/jcr:content/mainContent/resourcecomponent.img.png/1405383779033.png
    Machine Learning Loves Hadoop
    • Tuesday, Jun 24 2014
    • Category: Predictive modeling, Video, Recorded Webinars
    Watch this webinar to learn what machine learning is, why you should use machine learning algorithms, what the common challenges of machine learning are, and how Cloudera’s enterprise data hub supports machine learning.
  5. /content/cloudera/en/resources/library/recordedwebinar/compliance-ready-hadoop-comprehensive-security-for-the-enterpris/jcr:content/mainContent/resourcecomponent.img.png/1408058341746.png
    Comprehensive Security for the Enterprise I: Compliance Ready Hadoop
    • Thursday, Jun 19 2014
    • Category: Cyber security, Fraud detection, Financial Services, Healthcare & Life Sciences, Recorded Webinars, Video
    Learn how security in Hadoop is quickly changing, and what the key requirements are for taking Hadoop to the next level in your organization.
  6. /content/cloudera/en/resources/library/recordedwebinar/cognitive-computing--technology---infrastructure-driving-situati/jcr:content/mainContent/resourcecomponent.img.png/1408383616635.png
    Cognitive Computing, Technology & Infrastructure
    Driving Situational Awareness throughout the Financial Institution
    • Tuesday, Jun 17 2014
    • Category: Data hub, Financial Services, Video, Recorded Webinars
    Learn current state and future risk and compliance challenges facing the financial services industry, how cognitive computing paired with a scalable, secure and managed infrastructure can close current surveillance gaps, while reducing false positives and revealing real risks, and what is needed to operationalize and manage a bank-wide Hadoop environment.
  7. /content/cloudera/en/resources/library/recordedwebinar/intel-and-cloudera--accelerating-enterprise-big-data-success-video/jcr:content/mainContent/resourcecomponent.img.png/1405383703159.png
    Intel and Cloudera: Accelerating Enterprise Big Data Success
    • Thursday, Jun 12 2014
    • Category: Video, Recorded Webinars, Big Data, Data hub
    Learn how Cloudera and Intel are jointly innovating through open source software to enable Hadoop to run best on IA (Intel Architecture) and to foster the evolution of a vibrant Big Data ecosystem.
  8. /content/cloudera/en/resources/library/recordedwebinar/best-practices-for-the-hadoop-data-warehouse-video/jcr:content/mainContent/resourcecomponent.img.png/1405383645562.png
    Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
    • Thursday, May 29 2014
    • Category: Recorded Webinars, Video, Why Consolidation Data Platform, Data processing ETL offload
    Dr. Ralph Kimball and Eli Collins describe standard data warehouse best practices in Hadoop and how to implement them within a Hadoop environment. This includes identification of dimensions and facts, managing primary keys, and handling slowly changing dimensions (SCDs) and conformed dimensions.
  9. /content/cloudera/en/resources/library/recordedwebinar/large-scale-machine-learning-with-apache-spark/jcr:content/mainContent/resourcecomponent.img.png/1405383605390.png
    Large Scale Machine Learning with Apache Spark
    • Wednesday, May 21 2014
    • Category: Recorded Webinars, Video, CDH, Predictive modeling, Cyber security, Fraud detection
    Spark offers a number of advantages over its predecessor MapReduce that make it ideal for large-scale machine learning. For example, Spark includes MLLib, a library of machine learning algorithms for large data. The presentation will cover the state of MLLib and the details of some of the scalable algorithms it includes, mainly K-means.
  10. /content/cloudera/en/resources/library/recordedwebinar/sas-and-cloudera--analytics-at-scale/jcr:content/mainContent/resourcecomponent.img.png/1405383569146.png
    SAS® and Cloudera Analytics at Scale and Speed
    • Wednesday, May 07 2014
    • Category: Predictive modeling, Data hub, Business process optimization, Software Vendor (ISV), Video, CDH, Recorded Webinars
    Learn about SAS and Cloudera technical integration, how SAS builds on the enterprise data hub, and SAS In-Memory solutions for Hadoop and machine learning capabilities.