A modern data warehouse has the flexibility to play the role of traditional data warehouse, data lake, data mart, and whatever comes next.
Cloudera Data Warehouse delivers:
- Traditional data warehouse functions for BI reporting and modeling
- Data analytics for structured data, machine logs, text and IoT data
- On-demand self-service data access that encourages collaboration
- Support for diverse groups of users with varying levels of analytical skills
- Interoperability with machine learning engines and algorithms for easier experimentation
- Hybrid choice to run on-premises, on public clouds or any combination
- Security, control and governance for diverse data and analytics
Helping customers drive actionable insights from data
ENTERPRISE DATA WAREHOUSE (EDW) OPTIMIZATION
Augment what you’re doing today and relieve unnecessary EDW pressure
Decrease storage costs and focus on high-value reporting
Discover how unlimited scale keeps data accessible and out of archive
Eliminate contention and meet SLAs for routine reporting
Enable ad hoc and exploratory analytics for new insights
Operations data warehouse
Manage fast, flexible ETL over large data volumes, so data is always ready for your business.
Large unstructured data volumes of any type, such as web, log, and IoT data
Distributed processing and best-of-breed technologies for the fastest performance
Prepared data available immediately for analytics with shared storage and metadata
Research & discovery data warehouse
Support high-performance, ad hoc access for more users.
No rigid data modeling encumbrances for agile acquisition
Interactive responses for iterative exploration and modeling
Ability to handle all BI and SQL users and integrate with the leading BI tools
Easy addition of nodes to handle more data and users
Enterprise-grade open-source integration
We carefully curate, integrate, and support open-source tooling for a modern data warehouse. Our interactive SQL engine powered by Apache Impala delivers high-scale, high-concurrency queries. With Apache Kudu run discovery and operational workloads on fast changing and IoT data. Rapid ETL/ELT processing for data preparation is delivered through Apache Hive on Spark. Optimize search and discovery with Apache Solr. Interact with each of these analytics engines with an intelligent SQL Workbench powered by HUE.
Security, control, and governance for diverse data and analytics
Unlike traditional analytics systems, Cloudera Data Warehouse goes beyond SQL to deliver insights that can be leveraged for machine learning and real-time operations. And also unlike traditional analytics systems, Cloudera Data Warehouse is built with Cloudera SDX, a shared data experience. SDX is a powerful software framework that applies a consistent set of security and governance policies—backed by a common data catalog—against diverse analytic workloads running in the cloud, on premises, or in hybrid environments. A Cloudera shared data experience makes it easier and safer to get the most from a modern data warehouse.