We will take you through an overview of Cloudera’s interactive data science four-day workshop. Cloudera’s interactive four-day workshop covers data science concepts and enabling technologies in a hands-on lab setting (utilizing CDSW) where you can apply these techniques toward real-world use cases.
In this course students will learn to:
• Use Cloudera Data Science Workbench (CDSW) to manage and share data science projects and to develop and run Python, R, and Apache Spark programs
• Use the Spark SQL library to read, write, inspect, cleanse, transform, join, aggregate, and explore data stored in the Apache Hadoop ecosystem
• Use the Spark MLlib library to extract, transform, and select features for machine learning algorithms and fit, evaluate, tune, and apply machine learning models