Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/recordedwebinar/compliance-ready-hadoop-comprehensive-security-for-the-enterpris/jcr:content/mainContent/resourcecomponent.img.png/1408058341746.png
    Comprehensive Security for the Enterprise I: Compliance Ready Hadoop
    • Thursday, Jun 19 2014
    • Category: Cyber security, Fraud detection, Financial Services, Healthcare & Life Sciences, Recorded Webinars, Video
    Learn how security in Hadoop is quickly changing, and what the key requirements are for taking Hadoop to the next level in your organization.
  2. /content/cloudera/en/resources/library/recordedwebinar/cognitive-computing--technology---infrastructure-driving-situati/jcr:content/mainContent/resourcecomponent.img.png/1408383616635.png
    Cognitive Computing, Technology & Infrastructure
    Driving Situational Awareness throughout the Financial Institution
    • Tuesday, Jun 17 2014
    • Category: Data hub, Financial Services, Video, Recorded Webinars
    Learn current state and future risk and compliance challenges facing the financial services industry, how cognitive computing paired with a scalable, secure and managed infrastructure can close current surveillance gaps, while reducing false positives and revealing real risks, and what is needed to operationalize and manage a bank-wide Hadoop environment.
  3. /content/cloudera/en/resources/library/hbasecon2014/tales-from-the-cloudera-field---operations-session-4/jcr:content/mainContent/resourcecomponent.img.jpg/1405465861861.jpg
    Tales from the Cloudera Field - Operations Session 4
    • Monday, Jun 16 2014
    • Category: Presentation, HBaseCon, Video
    From supporting the 0.90.x, 0.92, 0.94, and 0.96 HBase installations on clusters ranging from tens to hundreds of nodes, Cloudera has seen it all. Having automated the upgrade paths from the different Apache releases, we have developed a smooth path that can help the community with upcoming upgrades. In addition to automation best practices, in this talk you'll also learn proactive configuration tweaks and operational best practices to keep your HBase cluster always up and running. We'll also walk through how to contain an application bug let loose in production, to minimize the impact on HBase posed by faulty hardware, and the direct correlation between inefficient schema design and HBase performance.
  4. /content/cloudera/en/resources/library/hbasecon2014/real-time-hbase--lessons-from-the-cloud---operations-session-3/jcr:content/mainContent/resourcecomponent.img.jpg/1405465841711.jpg
    Real-time HBase: Lessons from the Cloud - Operations Session 3
    • Monday, Jun 16 2014
    • Category: HBaseCon, Presentation, Video
    Running HBase in real time in the cloud provides an interesting and ever-changing set of challenges -- instance types are not ideal, neighbors can degrade your performance, and instances can randomly die in unanticipated ways. This talk will cover what HubSpot has learned about running in production on Amazon EC2, how to handle DR and redundancy, and the tooling the team has found to be the most helpful.
  5. /content/cloudera/en/resources/library/hbasecon2014/hbase-backups---operations-session-5/jcr:content/mainContent/resourcecomponent.img.jpg/1405465878235.jpg
    Operations Session 5 - HBase Backups
    • Monday, Jun 16 2014
    • Category: Presentation, Video, HBaseCon
    This talk provides an overview of enterprise-scale backup strategies for HBase: Jesse Yates will describe how Salesforce.com runs backup and recovery on its multi-tenant, enterprise scale HBase deploys; Demai Ni, Songqinq Ding, and Jing Chen of the IBM InfoSphere BigInsights development team will then follow with a description of IBM's recently open-sourced disaster/recovery solution based on HBase snapshots and replication.
  6. /content/cloudera/en/resources/library/recordedwebinar/intel-and-cloudera--accelerating-enterprise-big-data-success/jcr:content/mainContent/resourcecomponent.img.png/1407188813596.png
    Intel and Cloudera: Accelerating Enterprise Big Data Success
    • Thursday, Jun 12 2014
    • Category: Data hub, Business process optimization, Big Data, Presentation, Presentation Slides
    Learn how Cloudera and Intel are jointly innovating through open source software to enable Hadoop to run best on IA (Intel Architecture) and to foster the evolution of a vibrant Big Data ecosystem.
  7. /content/cloudera/en/resources/library/hbasecon2014/Harmonizing-Multi-Tenant-HBase-Clusters-for-Managing-Workload-Diversity/jcr:content/mainContent/resourcecomponent.img.jpg/1405465783974.jpg
    HBaseCon 2014 | Harmonizing Multi-tenant HBase Clusters for Managing Workload Diversity -Operations Session 1
    • Thursday, Jun 05 2014
    • Category: HBaseCon, Video, Presentation
    In early 2013, Yahoo! introduced multi-tenancy to HBase to offer it as a platform service for all Hadoop users. A certain degree of customization per tenant (a user or a project) was achieved through RegionServer groups, namespaces, and customized configs for each tenant. This talk covers how to accommodate diverse needs to individual tenants on the cluster, as well as operational tips and techniques that allow Yahoo! to automate the management of multi-tenant clusters at petabyte scale without errors.
  8. /content/cloudera/en/resources/library/video/merkle-delivers-connected-consumer-recognition-with-its-enterpri/jcr:content/mainContent/resourcecomponent.img.png/1405457401118.png
    Merkle Delivers Connected Consumer Recognition with Its Enterprise Data Hub
    • Wednesday, Jun 04 2014
    • Category: Video, Case Studies
    The Cloudera-powered EDH that Merkle deployed at the center of its big data infrastructure in about six months, "is a foundational component for our entire business because data is at the core of our marketing."
  9. /content/cloudera/en/resources/library/analystreport/funny-name--serious-security--cloudera-buys-encryption-vendor-ga/jcr:content/mainContent/resourcecomponent.img.png/1405379635902.png
    451 Report: Funny name, serious security: Cloudera buys encryption vendor Gazzang
    • Tuesday, Jun 03 2014
    • File Type: .PDF
    • Category: Cyber security, Document, Analyst Reports
    Gazzang, a partner of Cloudera since 2012, was acquired as a technology buy.
  10. /content/cloudera/en/resources/library/recordedwebinar/best-practices-for-the-hadoop-data-warehouse-slides/jcr:content/mainContent/resourcecomponent.img.png/1407188576036.png
    Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
    • Thursday, May 29 2014
    • Category: Video, Why Consolidation Data Platform, Data processing ETL offload, Presentation Slides
    Dr. Ralph Kimball and Eli Collins describe standard data warehouse best practices in Hadoop and how to implement them within a Hadoop environment. This includes identification of dimensions and facts, managing primary keys, and handling slowly changing dimensions (SCDs) and conformed dimensions.