Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/recordedwebinar/introduction-to-apache-spark-developer-training-slides/jcr:content/mainContent/resourcecomponent.img.png/1407174301105.png
    Introduction to Apache Spark Developer Training
    • Wednesday, Jul 23 2014
    • Category: Presentation Slides, About Training
    Learn what Apache Spark is and how it compares to Hadoop MapReduce, how to filter, map, reduce, and save Resilient Distributed Datasets (RDDs), who is best suited to attend the course and what prior knowledge you should have, and the benefits of building Spark applications as part of an enterprise data hub.
  2. /content/cloudera/en/resources/library/recordedwebinar/introduction-to-apache-spark-developer-training/jcr:content/mainContent/resourcecomponent.img.png/1407174414484.png
    Introduction to Apache Spark Developer Training
    • Wednesday, Jul 23 2014
    • Category: Video, Recorded Webinars, About Training
    Learn what Apache Spark is and how it compares to Hadoop MapReduce, how to filter, map, reduce, and save Resilient Distributed Datasets (RDDs), who is best suited to attend the course and what prior knowledge you should have, and the benefits of building Spark applications as part of an enterprise data hub.
  3. /content/cloudera/en/resources/library/recordedwebinar/slides-hadoop-security-iii-protecting-data-at-rest-and-in-motion/jcr:content/mainContent/resourcecomponent.img.png/1407174714552.png
    Comprehensive Security for the Enterprise III: Protecting Data at Rest and In Motion
    • Tuesday, Jul 22 2014
    • Category: Data hub, Financial Services, Security, Presentation Slides
    This webinar discusses how you can use Navigator capabilities such as Encrypt and Key Trustee to secure data and enable compliance. Additionally, we will discuss our joint work with Intel on Project Rhino (an initiative to improve data security in Hadoop). We also hear from a security architect at a financial services company that is using encryption and key management to meet financial regulatory requirements.
  4. /content/cloudera/en/resources/library/recordedwebinar/video--hadoop-security-iii---protecting-data-at-rest-and-in-moti/jcr:content/mainContent/resourcecomponent.img.png/1407174732858.png
    Comprehensive Security for the Enterprise III: Protecting Data at Rest and In Motion
    • Tuesday, Jul 22 2014
    • Category: Data hub, Financial Services, Security, Video, Recorded Webinars
    This webinar discusses how you can use Navigator capabilities such as Encrypt and Key Trustee to secure data and enable compliance. Additionally, we will discuss our joint work with Intel on Project Rhino (an initiative to improve data security in Hadoop). We also hear from a security architect at a financial services company that is using encryption and key management to meet financial regulatory requirements.
  5. /content/cloudera/en/resources/library/recordedwebinar/video--kite-sdk--working-with-datasets/jcr:content/mainContent/resourcecomponent.img.png/1407174600679.png
    Kite SDK: Working with Datasets
    • Thursday, Jul 17 2014
    • Category: apache hadoop, Open Source Cloudera, Recorded Webinars, Video
    The Kite SDK is an open source set of libraries, tools, examples, and documentation focused on helping developers build systems on top of the Apache Hadoop ecosystem. Learn (via examples) how Kite makes it easier to work with data in HDFS and Apache HBase as records and datasets, just as you would with a relational database.
  6. /content/cloudera/en/resources/library/recordedwebinar/slides-kite-sdk--working-with-datasets/jcr:content/mainContent/resourcecomponent.img.png/1407174581224.png
    Kite SDK: Working with Datasets
    • Thursday, Jul 17 2014
    • Category: apache hadoop, Open Source Cloudera, Presentation Slides
    The Kite SDK is an open source set of libraries, tools, examples, and documentation focused on helping developers build systems on top of the Apache Hadoop ecosystem. Learn (via examples) how Kite makes it easier to work with data in HDFS and Apache HBase as records and datasets, just as you would with a relational database.
  7. /content/cloudera/en/resources/library/video/capitalizing-on-big-data-opportunities-with-capgemini-and-cloudera/jcr:content/mainContent/resourcecomponent.img.jpg/1405457477011.jpg
    Capitalizing on Big Data Opportunities with Capgemini and Cloudera
    • Monday, Jul 07 2014
    • Category: Video, Cloudera Enterprise
    The Enterprise Data Hub Accelerator helps organizations execute their first Big Data projects quickly and effectively by providing a clear and complete roadmap on how to scale the data platform, governance, and analytics.
  8. /content/cloudera/en/resources/library/recordedwebinar/compliance-ready-hadoop-comprehensive-security-for-the-enterpris/jcr:content/mainContent/resourcecomponent.img.png/1408058341746.png
    Comprehensive Security for the Enterprise I: Compliance Ready Hadoop
    • Thursday, Jun 19 2014
    • Category: Cyber security, Fraud detection, Financial Services, Healthcare & Life Sciences, Recorded Webinars, Video
    Learn how security in Hadoop is quickly changing, and what the key requirements are for taking Hadoop to the next level in your organization.
  9. /content/cloudera/en/resources/library/recordedwebinar/cognitive-computing--technology---infrastructure-driving-situati/jcr:content/mainContent/resourcecomponent.img.png/1408383616635.png
    Cognitive Computing, Technology & Infrastructure
    Driving Situational Awareness throughout the Financial Institution
    • Tuesday, Jun 17 2014
    • Category: Data hub, Financial Services, Video, Recorded Webinars
    Learn current state and future risk and compliance challenges facing the financial services industry, how cognitive computing paired with a scalable, secure and managed infrastructure can close current surveillance gaps, while reducing false positives and revealing real risks, and what is needed to operationalize and manage a bank-wide Hadoop environment.
  10. /content/cloudera/en/resources/library/hbasecon2014/tales-from-the-cloudera-field---operations-session-4/jcr:content/mainContent/resourcecomponent.img.jpg/1405465861861.jpg
    Tales from the Cloudera Field - Operations Session 4
    • Monday, Jun 16 2014
    • Category: Presentation, HBaseCon, Video
    From supporting the 0.90.x, 0.92, 0.94, and 0.96 HBase installations on clusters ranging from tens to hundreds of nodes, Cloudera has seen it all. Having automated the upgrade paths from the different Apache releases, we have developed a smooth path that can help the community with upcoming upgrades. In addition to automation best practices, in this talk you'll also learn proactive configuration tweaks and operational best practices to keep your HBase cluster always up and running. We'll also walk through how to contain an application bug let loose in production, to minimize the impact on HBase posed by faulty hardware, and the direct correlation between inefficient schema design and HBase performance.