Cloudera named a leader in 2022 Gartner® Magic Quadrant™ for Cloud Database Management Systems Get the report
How to deploy interpretable models to business users

Fast Forward Labs Research now available without a subscription

All of our applied machine learning research is now publicly available and free to download from the Cloudera Fast Forward Labs landing page.

Dive in

Navigating the machine learning landscape

Despite its promise, machine learning can be downright daunting. Best efforts can be quickly undermined by uncertainty about a rapidly changing technical landscape, bewilderment on how best to build and organize teams, and difficulty separating hype from reality. 

Cloudera Fast Forward Labs Research focuses on emerging trends that are still changing due to algorithmic breakthrough, hardware breakthrough, technological commoditization, and data availability. These are encapsulated in our reports, blog posts, and applied machine learning prototypes (AMPs), which exhibit the capabilities of ML algorithms while adhering to best practices.

Who are we? 

Cloudera Fast Forward Labs is an applied machine learning research group. Our mission is to empower enterprise data science practitioners to apply emergent academic research to production machine learning use cases in practical and socially responsible ways, while also driving innovation through the Cloudera ecosystem.  Our team brings thoughtful, creative, and diverse perspectives to deeply researched work. In this way, we strive to help organizations make the most of their ML investment as well as educate and inspire the broader machine learning and data science community.

Want to learn more? 

Subscribe to our newsletter for updates on new research, new applied machine learning prototypes, and evolving data science and machine learning developments.

Latest research


Inferring Concept Drift Without Labeled Data

Concept drift occurs when the statistical properties of a target domain change over time causing model performance to degrade. Drift detection is generally achieved by monitoring a performance metric of interest and triggering a retraining pipeline when that metric falls below some designated threshold. However, this approach assumes ample labeled data is available at prediction time - an unrealistic constraint for many production systems. In this report, we explore various approaches for dealing with concept drift when labeled data is not readily accessible.

Read the report

Explore the code

 Inferring Concept Drift Without Labeled Data

Exploring Multi-Objective Hyperparameter Optimization

We develop machine learning models against the “usual suspect” metrics like predictive accuracy, recall, and precision. However, these metrics are rarely truly all we care about. Production models must also satisfy physical requirements such as latency or memory footprint, or fairness constraints. Hyperparameter optimization becomes even more challenging when we have multiple metrics to optimize. Our latest research examines this “multi-objective” hyperparameter optimization scenario in detail.

Read the report

Explore the code

Exploring Multi-Objective Hyperparameter Optimization

The Fast Forward Labs Blog

  • CFFL November Newsletter
    November 2022 Perhaps November conjures thoughts of holiday feasts and festivities, but for us, it’s...
  • Implementing CycleGAN
    Introduction This post documents the first part of a research effort to quantify the impact of synth...
  • CFFL October Newsletter
    October 2022 We’ve got another action-packed newsletter for October! Highlights this month include t...

Keep up with tomorrow

Sign up for our monthly newsletter and get the latest on advances in applied artificial intelligence, as well as company news and events.


Learn to how to build an automated question answering system with deep learning


Learn why and when to use deep learning for anomaly detection.


Learn how to combine causal inference with machine learning

Your form submission has failed.

This may have been caused by one of the following:

  • Your request timed out
  • A plugin/browser extension blocked the submission. If you have an ad blocking plugin please disable it and close this message to reload the page.