Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/recordedwebinar/big-data-retail-therapy--how-big-data-tempts-customers-to-buy-mo/jcr:content/mainContent/resourcecomponent.img.png/1407187622917.png
    Big Data Retail Therapy: How big data tempts customers to buy more
    • Thursday, Jul 24 2014
    • Category: Data hub, Retail, eCommerce & Consumer Products, Ad offer targeting, Video, Recorded Webinars, Case Studies
    Learn how to tap consumer data for your business analysts, data scientists, and marketers, to tailor offers and customize promotions that build customer loyalty. Gain real-world insights from Pentaho, Cloudera, and edo Interactive, a big data trailblazer.
  2. /content/cloudera/en/resources/library/recordedwebinar/introduction-to-apache-spark-developer-training/jcr:content/mainContent/resourcecomponent.img.png/1407174414484.png
    Introduction to Apache Spark Developer Training
    • Wednesday, Jul 23 2014
    • Category: Video, Recorded Webinars, About Training
    Learn what Apache Spark is and how it compares to Hadoop MapReduce, how to filter, map, reduce, and save Resilient Distributed Datasets (RDDs), who is best suited to attend the course and what prior knowledge you should have, and the benefits of building Spark applications as part of an enterprise data hub.
  3. /content/cloudera/en/resources/library/recordedwebinar/video--hadoop-security-iii---protecting-data-at-rest-and-in-moti/jcr:content/mainContent/resourcecomponent.img.png/1407174732858.png
    Comprehensive Security for the Enterprise III: Protecting Data at Rest and In Motion
    • Tuesday, Jul 22 2014
    • Category: Data hub, Financial Services, Security, Video, Recorded Webinars
    This webinar discusses how you can use Navigator capabilities such as Encrypt and Key Trustee to secure data and enable compliance. Additionally, we will discuss our joint work with Intel on Project Rhino (an initiative to improve data security in Hadoop). We also hear from a security architect at a financial services company that is using encryption and key management to meet financial regulatory requirements.
  4. /content/cloudera/en/resources/library/recordedwebinar/video--kite-sdk--working-with-datasets/jcr:content/mainContent/resourcecomponent.img.png/1407174600679.png
    Kite SDK: Working with Datasets
    • Thursday, Jul 17 2014
    • Category: apache hadoop, Open Source Cloudera, Recorded Webinars, Video
    The Kite SDK is an open source set of libraries, tools, examples, and documentation focused on helping developers build systems on top of the Apache Hadoop ecosystem. Learn (via examples) how Kite makes it easier to work with data in HDFS and Apache HBase as records and datasets, just as you would with a relational database.
  5. /content/cloudera/en/resources/library/recordedwebinar/video--hadoop-security-ii---guarding-the-perimeter-and-controlli/jcr:content/mainContent/resourcecomponent.img.png/1405383832994.png
    Comprehensive Security for the Enterprise II: Guarding the Perimeter and Controlling Access
    • Thursday, Jul 10 2014
    • Category: Video, Recorded Webinars, Cyber security
    One of the benefits of Hadoop is that it easily allows for multiple entry points both for data flow and user access. Here we discuss how Cloudera allows you to preserve the agility of having multiple entry points while also providing strong, easy to manage authentication. Additionally, we discuss how Cloudera provides unified authorization to easily control access for multiple data processing engines.
  6. /content/cloudera/en/resources/library/video/capitalizing-on-big-data-opportunities-with-capgemini-and-cloudera/jcr:content/mainContent/resourcecomponent.img.jpg/1405457477011.jpg
    Capitalizing on Big Data Opportunities with Capgemini and Cloudera
    • Monday, Jul 07 2014
    • Category: Video, Cloudera Enterprise
    The Enterprise Data Hub Accelerator helps organizations execute their first Big Data projects quickly and effectively by providing a clear and complete roadmap on how to scale the data platform, governance, and analytics.
  7. /content/cloudera/en/resources/library/recordedwebinar/machine-learning-loves-hadoop/jcr:content/mainContent/resourcecomponent.img.png/1405383779033.png
    Machine Learning Loves Hadoop
    • Tuesday, Jun 24 2014
    • Category: Predictive modeling, Video, Recorded Webinars
    Watch this webinar to learn what machine learning is, why you should use machine learning algorithms, what the common challenges of machine learning are, and how Cloudera’s enterprise data hub supports machine learning.
  8. /content/cloudera/en/resources/library/recordedwebinar/compliance-ready-hadoop-comprehensive-security-for-the-enterpris/jcr:content/mainContent/resourcecomponent.img.png/1408058341746.png
    Comprehensive Security for the Enterprise I: Compliance Ready Hadoop
    • Thursday, Jun 19 2014
    • Category: Cyber security, Fraud detection, Financial Services, Healthcare & Life Sciences, Recorded Webinars, Video
    Learn how security in Hadoop is quickly changing, and what the key requirements are for taking Hadoop to the next level in your organization.
  9. /content/cloudera/en/resources/library/recordedwebinar/cognitive-computing--technology---infrastructure-driving-situati/jcr:content/mainContent/resourcecomponent.img.png/1408383616635.png
    Cognitive Computing, Technology & Infrastructure
    Driving Situational Awareness throughout the Financial Institution
    • Tuesday, Jun 17 2014
    • Category: Data hub, Financial Services, Video, Recorded Webinars
    Learn current state and future risk and compliance challenges facing the financial services industry, how cognitive computing paired with a scalable, secure and managed infrastructure can close current surveillance gaps, while reducing false positives and revealing real risks, and what is needed to operationalize and manage a bank-wide Hadoop environment.
  10. /content/cloudera/en/resources/library/hbasecon2014/hbase-backups---operations-session-5/jcr:content/mainContent/resourcecomponent.img.jpg/1405465878235.jpg
    Operations Session 5 - HBase Backups
    • Monday, Jun 16 2014
    • Category: Presentation, Video, HBaseCon
    This talk provides an overview of enterprise-scale backup strategies for HBase: Jesse Yates will describe how Salesforce.com runs backup and recovery on its multi-tenant, enterprise scale HBase deploys; Demai Ni, Songqinq Ding, and Jing Chen of the IBM InfoSphere BigInsights development team will then follow with a description of IBM's recently open-sourced disaster/recovery solution based on HBase snapshots and replication.