Why Cloudera + Pentaho?
With Pentaho and Cloudera, organizations get a comprehensive view of data across the enterprise. Optimized for Cloudera Enterprise, Pentaho's Big Data Integration and Analytics platform gives organizations a competitive edge in a data-driven enterprise. Pentaho and Cloudera continuously collaborate to address the big data market and customer challenges use cases. The close partnership and an open platform enable Pentaho to optimize performance and access to Cloudera Enterprise featuring integrations with Impala, YARN, and Spark.
Pentaho data integration prepares and blends data to create a complete picture of your business that drives actionable insights. The platform delivers accurate, “analytics ready” data to end users from any source. With visual tools to eliminate coding and complexity, Pentaho puts big data and all data sources at the fingertips of business and IT users.
Pentaho and Cloudera share a common history and approach to simplifying complex, but powerful technologies to integrate and analyze big data. Our common open source heritage means that we can innovate at the speed of our customers businesses.
Eddie White, EVP Business Development, Pentaho
Joint Solution Overview
With Cloudera Enterprise, leading organizations are changing the way they think—transforming data from an expense to an asset. Pentaho’s comprehensive analytics platform, data integration, and a spectrum of data visualization and analysis capabilities can bring this data to life. Deeply integrated with technologies such as Impala, Search, and YARN, Pentaho’s Analytics Platform is optimized for Cloudera Enterprise.
Fast and Flexible Data Ingestion and Transformation
- Visual Data Integration interface eliminates the need for highly specialized Java and Hadoop MapReduce skills.
- Support for Spark, HDFS, HBase, Impala, YARN, Sqoop, Flume, Hive, Pig
Governed Data Delivery
- Streamlines the delivery of governed analytics-ready data sets balancing business users’ need to analyze the right data with IT Management’s need to control access.
Visual Data Preparation and Modeling Tools for Data Scientists
- Build, train and execute analytic models at scale
- Simplified support for R and Weka included in Pentaho’s Data Science Pack
- Easily deliver analytic results into downstream processes
- Watch the webinar: Big Data Retail Therapy- How Big Data Tempts Customers to Buy More
- Watch the video: Demonstrating Cloudera Search and Impala using data from Chicago Crime Data Warehouse
- Success story: EDO Optimizes Data Warehouse, increases loyalty and targets new customers
- Read the Joint Solution Brief
Pentaho, a Hitachi Group company, is a leading data integration and business analytics company with an enterprise-class, open source-based platform for diverse big data deployments. Pentaho’s unified data integration and analytics platform is comprehensive, completely embeddable and delivers governed data to power any analytics in any environment.
Analytics and Business Intelligence, Data Integration
- Continuous collaboration to address the big data market and customer challenges use cases
- Pentaho’s Analytics Platform is deeply integrated and optimized for Cloudera Enterprise
- Pentaho's Big Data Integration and Analytics platform gives organizations a competitive edge in a data-driven enterprise