Cloudera Summer School provides the content you need to sharpen your skills and let you manage and secure the data lifecycle in any cloud and datacenter.
How does it work?
5 topics, 2 weeks, to start your knowledge journey. You can choose as many topics as you like; scroll down to see the list of what we have on offer. Each topic is delivered in weekly shots, with a variety of material like demo’s, webinars, blog posts, research and reports. And don’t worry, we will not send you an overload!
Get your grades!
It wouldn’t be school without grades. Test your knowledge in the last week of the program with a short exam to assess your improvement and get your online certification in big data proficiency.
Here's what's on offer
Topic | Description |
DataFlowCollect flows, streams, and analytics for the lifecycle |
Easily ingest, route, manage, and deliver data-at-rest and data-in-motion from the Edge, any cloud, or the data center to any downstream system with built-in end-to-end security and provenance. CDP Data Hub uses Apache NiFi for flow management and Apache Kafka for streams messaging—both part of Cloudera DataFlow, a comprehensive, real-time streaming data platform that delivers key insights and immediate actionable intelligence. |
Data EngineeringEnrich, refine, structure, and prepare data for the lifecycle |
Cloudera Data Engineering helps enrich, transform, and cleanse data, making it easy to create, execute, and manage end-to-end data pipelines. It executes a wide range of data processing workloads in an extremely high-performance manner including batch and real-time stream processing using Apache Spark and Spark Streaming, supported by multiple storage options including Apache HBase, Apache Kudu, and cloud object storage. |
Data WarehouseProvide self-service access to reporting for the lifecycle |
Deliver business insights on massive amounts of verified data to thousands of users at extreme speed and scale without compromising compliance and blowing budgets. Seamlessly and securely moving on-premises workloads to any cloud, Cloudera Data Warehouse outperforms shadow IT by keeping up with evolving business requirements and meeting SLAs with self-service access to reports, dashboards, and advanced analytics. |
Operational DBServe all types of data from all sources for the lifecycle |
Cloudera Operational Database serves structured data alongside unstructured data within a unified end-to-end open-source platform, ensuring decision making is driven by stream processing and real-time analytics on continuously changing data. Users can serve real-time data at scale, with high concurrency and low latency, and data science at scale in order to easily build, score, and deploy machine learning models into production. |
Machine LearningOperationalize predicting for the lifecycle |
Accelerate enterprise data science from research to production at scale with self-service, collaborative workflows for building and operationalizing machine learning models. Using Python, R, and Scala directly in the web browser, Cloudera Machine Learning delivers a powerful self-service experience for data science teams to develop and prototype new machine learning capabilities and easily deploy them to production. |