Learn how Apache Hadoop addresses the limitations of traditional computing, helps businesses overcome real challenges, and powers new types of big data analytics. This series also introduces the rest of the Apache Hadoop ecosystem and outlines how to prepare the data center and manage Hadoop in production.
Cloudera Manager simplifies deployment, configuration, diagnostics, and reporting for CDH in production. Learn how to set up and customize Cloudera Manager to monitor and improve the performance of any size Hadoop cluster, increase compliance, and reduce costs.
Watch this free, online webinar to learn more about the course’s objectives, outline, prerequisites, and technical benefits, including a portion of Cloudera's full Data Analyst Training. We discuss the fundamentals of Cloudera Impala, Apache Hive, and Apache Pig, how they relate to each other, and for which jobs each is used.
At their core, YARN and MapReduce 2’s improvements separate cluster resource management capabilities from MapReduce-specific logic. YARN enables Hadoop to share resources dynamically between multiple parallel processing frameworks such as Cloudera Impala, allows more sensible and finer-grained resource configuration for better cluster utilization, and scales Hadoop to accommodate more and larger jobs.
This webinar delineates the topics, learning objectives, audience, and prerequisites for Cloudera's live Hadoop Administrator Training. We present two short portions of the actual course, providing an overview of HDFS High Availability and best practice settings for some of Hadoop's more advanced configuration options.
Learn how HBase Training can help you develop your HBase use case, design optimal schemas, and identify, avoid, and resolve performance bottlenecks. In this on-demand webinar, we present two short portions of Cloudera's full HBase Training, providing an overview of accessing data with the HBase API and executing Scan operations with both Java and Python, followed by Q&A.
Learn the new Parcel format for installing and upgrading CDH and other Hadoop ecosystem components. Parcels enable the new rolling upgrade functionality inCloudera Manager, provide rollback functionality, and make maintenance windows short and painless. In this e-learning module, we discuss the benefits of Parcels, compare Parcels and packages, and understand what a Parcel file contains. The module finishes with a complete demonstration of a CDH upgrade and several component installations, including Cloudera Impala and Cloudera Search.