Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/recordedwebinar/video--hadoop-security-iii---protecting-data-at-rest-and-in-moti/jcr:content/mainContent/resourcecomponent.img.png/1407174732858.png
    Comprehensive Security for the Enterprise III: Protecting Data at Rest and In Motion
    • Tuesday, Jul 22 2014
    • Category: Data hub, Financial Services, Security, Video, Recorded Webinars
    This webinar discusses how you can use Navigator capabilities such as Encrypt and Key Trustee to secure data and enable compliance. Additionally, we will discuss our joint work with Intel on Project Rhino (an initiative to improve data security in Hadoop). We also hear from a security architect at a financial services company that is using encryption and key management to meet financial regulatory requirements.
  2. /content/cloudera/en/resources/library/recordedwebinar/video--kite-sdk--working-with-datasets/jcr:content/mainContent/resourcecomponent.img.png/1407174600679.png
    Kite SDK: Working with Datasets
    • Thursday, Jul 17 2014
    • Category: apache hadoop, Open Source Cloudera, Recorded Webinars, Video
    The Kite SDK is an open source set of libraries, tools, examples, and documentation focused on helping developers build systems on top of the Apache Hadoop ecosystem. Learn (via examples) how Kite makes it easier to work with data in HDFS and Apache HBase as records and datasets, just as you would with a relational database.
  3. /content/cloudera/en/resources/library/recordedwebinar/video--hadoop-security-ii---guarding-the-perimeter-and-controlli/jcr:content/mainContent/resourcecomponent.img.png/1405383832994.png
    Comprehensive Security for the Enterprise II: Guarding the Perimeter and Controlling Access
    • Thursday, Jul 10 2014
    • Category: Video, Recorded Webinars, Cyber security
    One of the benefits of Hadoop is that it easily allows for multiple entry points both for data flow and user access. Here we discuss how Cloudera allows you to preserve the agility of having multiple entry points while also providing strong, easy to manage authentication. Additionally, we discuss how Cloudera provides unified authorization to easily control access for multiple data processing engines.
  4. /content/cloudera/en/resources/library/video/capitalizing-on-big-data-opportunities-with-capgemini-and-cloudera/jcr:content/mainContent/resourcecomponent.img.jpg/1405457477011.jpg
    Capitalizing on Big Data Opportunities with Capgemini and Cloudera
    • Monday, Jul 07 2014
    • Category: Video, Cloudera Enterprise
    The Enterprise Data Hub Accelerator helps organizations execute their first Big Data projects quickly and effectively by providing a clear and complete roadmap on how to scale the data platform, governance, and analytics.
  5. /content/cloudera/en/resources/library/recordedwebinar/machine-learning-loves-hadoop/jcr:content/mainContent/resourcecomponent.img.png/1405383779033.png
    Machine Learning Loves Hadoop
    • Tuesday, Jun 24 2014
    • Category: Predictive modeling, Video, Recorded Webinars
    Watch this webinar to learn what machine learning is, why you should use machine learning algorithms, what the common challenges of machine learning are, and how Cloudera’s enterprise data hub supports machine learning.
  6. /content/cloudera/en/resources/library/recordedwebinar/compliance-ready-hadoop-comprehensive-security-for-the-enterpris/jcr:content/mainContent/resourcecomponent.img.png/1408058341746.png
    Comprehensive Security for the Enterprise I: Compliance Ready Hadoop
    • Thursday, Jun 19 2014
    • Category: Cyber security, Fraud detection, Financial Services, Healthcare & Life Sciences, Recorded Webinars, Video
    Learn how security in Hadoop is quickly changing, and what the key requirements are for taking Hadoop to the next level in your organization.
  7. /content/cloudera/en/resources/library/recordedwebinar/cognitive-computing--technology---infrastructure-driving-situati/jcr:content/mainContent/resourcecomponent.img.png/1408383616635.png
    Cognitive Computing, Technology & Infrastructure
    Driving Situational Awareness throughout the Financial Institution
    • Tuesday, Jun 17 2014
    • Category: Data hub, Financial Services, Video, Recorded Webinars
    Learn current state and future risk and compliance challenges facing the financial services industry, how cognitive computing paired with a scalable, secure and managed infrastructure can close current surveillance gaps, while reducing false positives and revealing real risks, and what is needed to operationalize and manage a bank-wide Hadoop environment.
  8. /content/cloudera/en/resources/library/hbasecon2014/hbase-backups---operations-session-5/jcr:content/mainContent/resourcecomponent.img.jpg/1405465878235.jpg
    Operations Session 5 - HBase Backups
    • Monday, Jun 16 2014
    • Category: Presentation, Video, HBaseCon
    This talk provides an overview of enterprise-scale backup strategies for HBase: Jesse Yates will describe how Salesforce.com runs backup and recovery on its multi-tenant, enterprise scale HBase deploys; Demai Ni, Songqinq Ding, and Jing Chen of the IBM InfoSphere BigInsights development team will then follow with a description of IBM's recently open-sourced disaster/recovery solution based on HBase snapshots and replication.
  9. /content/cloudera/en/resources/library/hbasecon2014/real-time-hbase--lessons-from-the-cloud---operations-session-3/jcr:content/mainContent/resourcecomponent.img.jpg/1405465841711.jpg
    Real-time HBase: Lessons from the Cloud - Operations Session 3
    • Monday, Jun 16 2014
    • Category: HBaseCon, Presentation, Video
    Running HBase in real time in the cloud provides an interesting and ever-changing set of challenges -- instance types are not ideal, neighbors can degrade your performance, and instances can randomly die in unanticipated ways. This talk will cover what HubSpot has learned about running in production on Amazon EC2, how to handle DR and redundancy, and the tooling the team has found to be the most helpful.
  10. /content/cloudera/en/resources/library/hbasecon2014/tales-from-the-cloudera-field---operations-session-4/jcr:content/mainContent/resourcecomponent.img.jpg/1405465861861.jpg
    Tales from the Cloudera Field - Operations Session 4
    • Monday, Jun 16 2014
    • Category: Presentation, HBaseCon, Video
    From supporting the 0.90.x, 0.92, 0.94, and 0.96 HBase installations on clusters ranging from tens to hundreds of nodes, Cloudera has seen it all. Having automated the upgrade paths from the different Apache releases, we have developed a smooth path that can help the community with upcoming upgrades. In addition to automation best practices, in this talk you'll also learn proactive configuration tweaks and operational best practices to keep your HBase cluster always up and running. We'll also walk through how to contain an application bug let loose in production, to minimize the impact on HBase posed by faulty hardware, and the direct correlation between inefficient schema design and HBase performance.