Resources for Data Analysts
Extend Your Learning Path with E-Learning
Cloudera University's e-learning courses present a deeper dive into the projects, skills, and techniques that aid and complement the core topics covered by the data analyst learning path. These on-demand videos address the concepts required to achieve true expertise. They also include interactive demonstrations and lab instruction so that you can work your way through technical challenges in your own time and at your own pace.
Learn how Apache Hadoop addresses the limitations of traditional computing, helps businesses overcome real challenges, and powers new types of Big Data analytics. This series also introduces the rest of the Apache Hadoop ecosystem and outlines how to prepare the data center and manage Hadoop in production.
Watch this free, online webinar to learn more about the course’s objectives, outline, prerequisites, and technical benefits, including a portion of Cloudera's full Data Analyst Training. We discuss the fundamentals of Cloudera Impala, Apache Hive, and Apache Pig, how they relate to each other, and for which jobs each is used.
Work at the speed of thought! This e-learning course explores Cloudera Impala's features, architecture, and benefits over legacy Hadoop platforms. Learn how to run interactive queries inside Impala and understand how it optimizes data systems. This free online course includes a training module, homework, and an Impala demo VM download to experiment with this powerful new tool.
Learn how to use interactive, full-text search to quickly find relevant data in Hadoop and solve critical business problems simply and in real time. Cloudera Search combines the established, feature-rich, open-source search platform of Apache Solr and its extensible APIs for easy integration with CDH. In this e-learning module, you will learn the fundamentals, use cases, and features of Cloudera Search. The module includes a short discussion of Cloudera Search architecture and a product demonstration.
Pig is an Apache project that uses a scripting language to query and analyze large data sets. With Apache Pig, users can create MapReduce programs without writing Java code. This e-learning module teaches you how to write user-defined functions (UDFs) that can be executed inside of Pig to extend performance and develop a custom library of operations. We discuss what Pig UDFs are, supported functions and languages, and how to write custom UDFs in Java and Python. The module includes a hands-on exercise where you will write your own UDF in Python, complete with a sample solution.
Hive is an Apache project that facilitates ad hoc queries and analyses of large data sets in the Hadoop cluster using a SQL-like language. This e-learning module teaches you how to write user-defined functions (UDFs) to augment Hive's built-in capabilities. We discuss why UDFs are necessary, what kinds of UDFs exist, and how to write custom UDFs in Java. The module includes a hands-on exercise where you will write your own UDF, complete with a sample solution.
In this video, you will learn what data scientists do, how they think about problems, the relationship between data science and Hadoop, and how Cloudera‘s Introduction to Data Science course can help you join this growing and increasingly important profession, followed by Q&A with Cloudera Senior Director of Data Science Josh Wills.
- Webinar: Impala: Real-Time Queries in Hadoop
Cloudera Impala: A Modern SQL Engine for Hadoop
Video: How Cloudera Impala Unlocks New Productivity and Insights
Find out about the technology powering Impala and the use cases driving its adoption from the engineers who developed it.
- Webinar: Big Data Search, Bigger Insights
Go deeper into how Cloudera integrates Apache Solr with CDH to bring scale and reliability for next-generation open-source search.
- Webinar: Enterprise Data Hub
Check out the next big thing driving business value from Big Data.
- Webinar: Insight into the EDH
Learn how the enterprise data hub can transform your business and deliver competitive advantage.
- If you're a developer, administrator, data analyst, HBase specialist, or aspiring data scientist, Cloudera offers training and certification to meet your needs.