Cloudera named a leader in 2022 Gartner® Magic Quadrant™ for Cloud Database Management Systems Get the report

Why Cloudera + Pentaho?

With Pentaho and Cloudera, organizations get a comprehensive view of data across the enterprise. Optimized for Cloudera Enterprise, Pentaho's Big Data Integration and Analytics platform gives organizations a competitive edge in a data-driven enterprise. Pentaho and Cloudera continuously collaborate to address the big data market and customer challenges use cases. The close partnership and an open platform enable Pentaho to optimize performance and access to Cloudera Enterprise featuring integrations with Impala, YARN, and Spark.

Pentaho data integration prepares and blends data to create a complete picture of your business that drives actionable insights. The platform delivers accurate, “analytics ready” data to end users from any source. With visual tools to eliminate coding and complexity, Pentaho puts big data and all data sources at the fingertips of business and IT users.

Pentaho and Cloudera share a common history and approach to simplifying complex, but powerful technologies to integrate and analyze big data. Our common open source heritage means that we can innovate at the speed of our customers businesses.

Eddie White, EVP Business Development, Pentaho

Joint Solution Overview

With Cloudera Enterprise, leading organizations are changing the way they think—transforming data from an expense to an asset. Pentaho’s comprehensive analytics platform, data integration, and a spectrum of data visualization and analysis capabilities can bring this data to life. Deeply integrated with technologies such as Impala, Search, and YARN, Pentaho’s Analytics Platform is optimized for Cloudera Enterprise.

Fast and Flexible Data Ingestion and Transformation

  • Visual Data Integration interface eliminates the need for highly specialized Java and Hadoop MapReduce skills.
  • Support for Spark, HDFS, HBase, Impala, YARN, Sqoop, Flume, Hive, Pig

Governed Data Delivery   

  • Streamlines the delivery of governed analytics-ready data sets balancing business users’ need to analyze the right data with IT Management’s need to control access.

Visual Data Preparation and Modeling Tools for Data Scientists

  • Build, train and execute analytic models at scale
  • Simplified support for R and Weka included in Pentaho’s Data Science Pack
  • Easily deliver analytic results into downstream processes

Pentaho Monetize My Data in the Partner Solutions Gallery

Pentaho Optimize Data Warehouse in the Partner Solutions Gallery

Learn More

About Pentaho

Pentaho, a Hitachi Group company, is a leading data integration and business analytics company with an enterprise-class, open source-based platform for diverse big data deployments. Pentaho’s unified data integration and analytics platform is comprehensive, completely embeddable and delivers governed data to power any analytics in any environment.

Your form submission has failed.

This may have been caused by one of the following:

  • Your request timed out
  • A plugin/browser extension blocked the submission. If you have an ad blocking plugin please disable it and close this message to reload the page.