Your browser is out of date!

Update your browser to view this website correctly. Update my browser now



This course includes video lectures, assessments, and hands-on exercise access. The course provides an introduction to Machine Learning, including coverage of collaborative filtering, clustering, classification, algorithms, and data volume.

Hands-on Hadoop

Through instructor-led discussion, as well as hands-on exercises, participants will learn topics including:

  • Data types, statistics support, feature extraction, transforming vectors, using the StandardScaler class
  • An overview of dimensionality reduction
  • Machine learning models, regression, linear regression support, and regularization. 
  • Finally, the course discusses machine learning with Spark ML topics such as using data frames, transformers and estimators, an introduction to pipelines, using pipelines to generate models, and regularization.

Audience and prerequisites

Introduction to Machine Learning does not have prerequisites, but student must know Python or Scala to understand the material covered. .

Please note that this course does not teach big data concepts, nor does it cover how to use Cloudera software. Instead, it is meant as a follow up to our Developer Training for Spark and Hadoop course.


Spark and Hadoop Developer Training

Scala and Python developers new to Hadoop will learn key concepts and expertise participants need to ingest and process data on a Hadoop cluster using the most up-to-date tools and techniques, including Apache Spark™, Impala, Hive, Flume, and Sqoop.

Learn More

Cloudera Developer Training was great. I believe Cloudera is the best vendor evangelizing the big data movement and doing a great service in promoting Hadoop in the industry. Thanks for all your help getting me started on this journey.

Cisco Systems

Learn more

Data Engineer Certification

This course is excellent preparation for the CCP: Data Engineer certification. Although we recommend further training and experience experience before attempting the exam, this course covers many of the subjects tested in the CCP: Data Engineer exam. CCP: Data Engineer lets you prove your skills with a rigorous hands-on exam, and promote your skills to potential and current employers.

Advance your career

Hadoop developers are among the world's most in-demand and highly-compensated technical roles. Check out some of the job opportunities currently listed that match the professional profile, many of which seek CCP qualification.

Private training

We also provide private training at your site, at your pace, and tailored to your needs.