NOTE: Although the exam is currently in beta (meaning that there may be some small edits made to the content) the exam allows you to earn your certification if you pass. 

CDP Data Analyst Exam Guide CDP-4001

Audience

This CDP Data Analyst exam tests the required Cloudera skills and knowledge required for data analysts to be successful in their role. The exam tests the use of Cloudera products such as Cloudera Data Visualization, Cloudera Machine Learning, Cloudera Data Science Workbench, Cloudera Data Warehouseas well as SQL, Apache Nifi, Apache Hive and other open source technologies.

Exam Details

  • Exam Number: CDP-4001

  • Number of questions: 50

  • Duration: 120 minutes

  • Pass Score: unpublished

    • We do not publish exam pass scores. Candidates should not be trying to achieve any particular score. Rather they should be aiming for the highest score possible.

  • Delivery: online, proctored

    • Please review the system requirements to enable online, proctored testing through QuestionMark

  • Allowed resources: none. 

    • You may not use reference materials, white papers, user guides or any other resources during your exam.

  • Support: if you need help, please email us

Cloudera Skills & Knowledge Measured

This exam measures the skills and knowledge topics listed in Table 1. below. The weighting of each topic is also listed.

Topic WEIGHT (% of exam)

Use Cloudera Data Visualization Building dashboards to display summary data

7.50%

Use Apache Hive or Apache Impala to combine data sets using unions or joins

27.50%

Use Cloudera Data Science Workbench to create machine learning applications and prototype advanced logic

7.50%

Use Cloudera Machine Learning to create machine learning applications and prototype advanced logic

5.00%
Use Apache Ranger and Atlas to secure database tables 7.50%
Use Apache Hive or Apache Impala to provide SQL access to data to browse existing databases and tables in big data systems 7.50%
Calculate aggregate statistics, such as sums and averages, during a query using Apache Hive or Apache Impala 30.00%
Develop and implement databases (and data collection systems?) using Cloudera Data Warehouse 7.50%

Table 1: Exam topics and weighting

Other Skills & Knowledge

Although, not directly tested on this exam, it is assumed that the candidate is familiar with the technologies listed below to help answer exam questions:

  • Salesforce

  • BI tools (e.g. Tableau, PowerBI, Spotfire, Sisense, Yellowfin, Looker, Palantir, etc.)

  • Google Sheets

  • Microsoft Excel

  • Python/R

Related Training

Although optional, the Cloudera Educational Services courses listed below cover some of the same topics as listed in the table above. Real world, hands-on experience is highly recommended whether you participate in training or not.

  • Data Analyst Training

  • Cloudera DataFlow: Flow Management with Apache NiFi

  • Cloudera Training for Apache Kafka

  • Cloudera Streaming Analytics: Using Apache Flink and SQL Stream Builder on CDP

  • Cloudera Data Engineering: Developing Applications with Apache Spark

  • Spark Application Performance Tuning Workshop

  • Cloudera Essentials for CDP (FREE!)

  • Introduction to Cloudera Machine Learning (FREE!)

  • Introduction to Cloudera Data Warehouse (FREE!)

Your form submission has failed.

This may have been caused by one of the following:

  • Your request timed out
  • A plugin/browser extension blocked the submission. If you have an ad blocking plugin please disable it and close this message to reload the page.