Cloudera named a leader in 2022 Gartner® Magic Quadrant™ for Cloud Database Management Systems Get the report

 Collaborative, Self-Service Environment for Secure Data Exploration, Visualization, and Modeling Brings Data Scientists, Analysts, and Business Teams Together

STRATA+HADOOP WORLD SAN JOSE, Calif., March 14, 2017 – Cloudera, the provider of the leading global platform for machine learning and advanced analytics built on the latest open source technologies, today unveiled Cloudera Data Science Workbench, a new self-service environment for data science on Cloudera Enterprise which is currently in beta. Based on the company's acquisition of data science startup last year, Data Science Workbench allows data scientists to use their favorite open source languages -- including R, Python, and Scala -- and libraries on a secure enterprise platform with native Apache Spark and Apache Hadoop integration, to accelerate analytics projects from exploration to production.

“Cloudera is focused on improving the user experience for data science and engineering teams, in particular those who want to scale their analytics using Spark for data processing and machine learning,” said Charles Zedlewski, senior vice president, Products at Cloudera. “The acquisition of and its team provided a strong foundation, and Data Science Workbench now puts self-service data science at scale within reach for our customers.”

Cloudera Data Science Workbench’s benefits include:

For data scientists -

  • Use R, Python, or Scala with your favorite libraries and frameworks, directly from a web browser.
  • Directly access data in secure Hadoop clusters with Spark and Impala.
  • Share insights with your whole team for reproducible, collaborative research.

For IT professionals -

  • Give your data science team the freedom to work how they want, when they want.
  • Stay compliant with out-of-the-box support for full Hadoop security, especially Kerberos.
  • Run on-premises or in the cloud, wherever you manage your data.

Beyond the extensive Python and R ecosystems, as open data science expands to include deep learning frameworks like Tensorflow, Microsoft Cognitive Toolkit, MXnet, BigDL, and more, data science teams are looking for ways to bring these tools to their data, which is increasingly stored in Hadoop environments Cloudera Data Science Workbench delivers a safe and secure environment to combine the latest open source innovations with the unified platform Cloudera customers trust.

“By providing ready access to data, Cloudera Data Science Workbench decreases time to value of AI applications delivered with the DataRobot automated machine learning platform,” said Jeremy Achin, DataRobot CEO and co-founder. “DataRobot is fully integrated which allows Cloudera users to increase business value from the world's best algorithms and data science techniques through an easy to use interface.”

“Our customers’ IT groups often struggle to onboard data scientists to shared environments because their needs are so diverse, especially where open source tools are involved. The result is usually duplication, analytic silos, and limited security and governance. Meanwhile, data scientists are constantly looking to scale their work to larger datasets and more powerful compute platforms,” continued Zedlewski. “With Data Science Workbench, Cloudera is helping IT groups and data scientists  work together, bringing more users to shared environments in a way that delivers both flexibility and compliance.”

Meet at Strata + Hadoop World San Jose

To learn more about Data Science Workbench and Cloudera’s overall data science and machine learning strategy, visit our booth 809 on the show floor at Strata+Hadoop World San Jose.

To learn about the common problems to look out for when scaling data science on Hadoop, attend Cloudera’s session entitled, ‘Making Self-service Data Science a Reality’ with Matt Brandwein (director, Products, Cloudera), Tristan Zajonc (senior engineering manager, Cloudera) from 1:50pm–2:30pm on Thursday, March 16, 2017 in 210 A/E at Strata+Hadoop World San Jose.

About Cloudera

Cloudera delivers the modern data management and analytics platform built on Apache Hadoop and the latest open source technologies. The world’s leading organizations trust Cloudera to help solve their most challenging business problems with Cloudera Enterprise, the fastest, easiest and most secure data platform available for the modern world. Our customers efficiently capture, store, process and analyze vast amounts of data, empowering them to use advanced analytics to drive business decisions quickly, flexibly and at lower cost than has been possible before. To ensure our customers are successful, we offer comprehensive support, training and professional services. Learn more at

Connect with Cloudera

About Cloudera:
Read our blog: 
Follow us on Twitter:
Visit us on Facebook:
Join the Cloudera Community:


Cloudera, Hue and associated marks and trademarks or registered trademarks of Cloudera Inc. All other company and product names may be trademarks of their respective owners.

Press Contact

Deborah Wiltshire


+1 (650) 644-3900


Your form submission has failed.

This may have been caused by one of the following:

  • Your request timed out
  • A plugin/browser extension blocked the submission. If you have an ad blocking plugin please disable it and close this message to reload the page.