Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/recordedwebinar/sas-and-cloudera--analytics-at-scale/jcr:content/mainContent/resourcecomponent.img.png/1405383569146.png
    SAS® and Cloudera Analytics at Scale and Speed
    • Wednesday, May 07 2014
    • Category: Predictive modeling, Data hub, Business process optimization, Software Vendor (ISV), Video, CDH, Recorded Webinars
    Learn about SAS and Cloudera technical integration, how SAS builds on the enterprise data hub, and SAS In-Memory solutions for Hadoop and machine learning capabilities.
  2. /content/cloudera/en/resources/library/hbasecon2014/the-state-of-hbase-replication/jcr:content/mainContent/resourcecomponent.img.png/1405465805610.png
    The State of HBase Replication - Operations Session 2
    • Monday, May 05 2014
    • Category: Video, HBaseCon, Recorded Webinars
    HBase Replication has come a long way since its inception in HBase 0.89 almost four years ago. Today, master-master and cyclic replication setups are supported; many bug fixes and new features like log compression, per-family peers configuration, and throttling have been added; and a major refactoring has been done. This presentation will recap the work done during the past four years, present a few use cases that are currently in production, and take a look at the roadmap.
  3. /content/cloudera/en/resources/library/recordedwebinar/tableau-cloudera-webinar-and-demo-201405/jcr:content/mainContent/resourcecomponent.img.png/1405383525267.png
    Govern This! Data Discovery and the application of data governance with new stack technologies
    • Thursday, May 01 2014
    • Category: Analytics & Business Intelligence, Business process optimization, Video, Recorded Webinars, CDH, Cloudera Enterprise
    Tableau joins us to share and demo how to apply governance to the discovery layer in an enterprise data hub while still meeting the speed and agility requirements of the business user. We also provide a Cloudera Navigator demo along with the Tableau demo.
  4. /content/cloudera/en/resources/library/recordedwebinar/introduction-to-designing-and-building-big-data-applications/jcr:content/mainContent/resourcecomponent.img.png/1405383480097.png
    Introduction to Designing and Building Big Data Applications
    • Thursday, Apr 24 2014
    • Category: Recorded Webinars, Video, About Training
    Learn what the course covers, from capturing data to building a search interface; the spectrum of processing engines, Apache projects, and ecosystem tools available for converged analytics; who is best suited to attend the course and what prior knowledge you should have; and the benefits of building applications with an enterprise data hub.
  5. /content/cloudera/en/resources/library/recordedwebinar/govloop-training--the-foundation-for-data-innovation--the-enterp/jcr:content/mainContent/resourcecomponent.img.png/1405383442243.png
    GovLoop Training: The Foundation for Data Innovation – The Enterprise Data Hub
    • Tuesday, Apr 15 2014
    • Category: Government, Counter-terrorism, Fraud detection, Video, Recorded Webinars
    Government and industry experts share how an EDH is a scalable solution that provides the right tools for effective outcomes, cuts waste, fraud and abuse, creates a better, safer platform for collaboration, and provide methods to help accelerate your time-to-value.
  6. /content/cloudera/en/resources/library/recordedwebinar/get-hired-as-a-certified-data-scientist-video/jcr:content/mainContent/resourcecomponent.img.png/1405383404627.png
    Get Hired as a Certified Data Scientist
    • Thursday, Apr 10 2014
    • Category: Predictive modeling, Video, Recorded Webinars, About Training
    Learn the details of the new Cloudera Data Science Challenge on healthcare claim anomalies from Sean Owen, the inventor of Myrrix and founder of the Oryx project, how to prepare for the challenge and the resources available to you, what types of insights drive business value from advanced Big Data analytics and the goals that have been achieved by Cloudera Certified Professional: Data Scientists.
  7. /content/cloudera/en/resources/library/recordedwebinar/building-a-hadoop-data-warehouse-with-impala/jcr:content/mainContent/resourcecomponent.img.png/1405383354411.png
    Building a Hadoop Data Warehouse with Impala
    • Wednesday, Apr 09 2014
    • Category: Data warehousing offload, Data hub, Video, CDH, Recorded Webinars
    Explore how Impala's architecture supports query speed over Hadoop data that not only convincingly exceeds that of Hive, but also that of a proprietary analytic DBMS over its own native columnar format, understand the current state of, and roadmap for, Impala's analytic SQL functionality, and see an example configuration and benchmark suite that demonstrate how Impala offers a high level of performance, functionality, and ability to handle a multi-user workload, while retaining Hadoop’s traditional strengths of flexibility and ease of scaling.
  8. /content/cloudera/en/resources/library/recordedwebinar/building-a-hadoop-data-warehouse-video/jcr:content/mainContent/resourcecomponent.img.png/1405383329478.png
    Building a Hadoop Data Warehouse: Hadoop 101 for EDW Professionals
    • Wednesday, Apr 02 2014
    • Category: Data processing ETL offload, Data warehousing offload, Video, Recorded Webinars
    Dr. Ralph Kimball explains how Hadoop can be both a destination data warehouse, and also an efficient staging and ETL source for an existing data warehouse. Learn how enterprise conformed dimensions can be used as the basis for integrating Hadoop and conventional data warehouses.
  9. /content/cloudera/en/resources/library/recordedwebinar/the-benefits-of-predictive-and-proactive-support/jcr:content/mainContent/resourcecomponent.img.png/1405383270218.png
    The Benefits of Predictive and Proactive Support for an Enterprise Data Hub
    • Thursday, Mar 27 2014
    • Category: Recorded Webinars, About Cloudera, Video, Presentation, Cloudera Professional Support, Hadoop Services
    Learn how Cloudera helps eliminate known issues and avoid common cluster misconfigurations, guides better utilization of the EDH according to comparative analysis, and ensures enterprises optimize support resources for faster issue resolution.
  10. /content/cloudera/en/resources/library/recordedwebinar/design-patterns-for-large-scale-real-time-learning/jcr:content/mainContent/resourcecomponent.img.png/1405383250978.png
    Design Patterns for Large-Scale Real-Time Learning
    • Wednesday, Mar 19 2014
    • Category: Recorded Webinars, Video
    Building a production-ready large-scale operational analytics system remains a difficult and ad-hoc endeavor, especially when real-time answers are required. Design patterns for effective implementations are emerging, which take advantage of relaxed assumptions, adopt a new tiered "lambda" architecture, and pick the right scale-friendly algorithms to succeed. Drawing on experience from customer problems, this session presents a reference architecture and algorithm design choices for a successful implementation.