Overview

Data lake flexibility & data warehouse performance in a single platform.

Data lakes store data of all shapes and sizes and provides the flexibility to run multiple compute engines. It is most suited for trends and statistical style analytics with varying degrees of accuracy. 

Data warehouses, on the other hand, are best-suited for SQL-style analytics where speed and accuracy are critical, but they can only store and process structured or semi-structured data.

For today’s business challenges, you need the best of both worlds, where you can do more precise analyses on all data — structured and unstructured at the speed of the data warehouse and flexibility of the data lake without unnecessary data transformations and movement. 

A lakehouse architecture delivers reliability and analytics performance for any data at scale. Run business intelligence, AI, and machine learning use cases on the same data with your choice of engine.

Key Characteristics
 

Enabling multi-function analytics on both streaming and stored data in a cloud-native object store across hybrid multi-cloud

 

Open

Cloudera’s Data Lakehouse is 100% open—open source, open standards-based, with wide community adoption. It can store multiple data formats  without requiring any transformations. It also enables multiple applications to work together transactionally. Built on Apache Iceberg’s open table format, it is open and flexible so the best-of-breed compute engines can operate on the same dataset for AI, ML, BI, log analytics, graph processing, and more. Your data is never locked in and you have control over it.

 

Open

Cloudera’s Data Lakehouse is 100% open—open source, open standards-based, with wide community adoption. It can store multiple data formats  without requiring any transformations. It also enables multiple applications to work together transactionally. Built on Apache Iceberg’s open table format, it is open and flexible so the best-of-breed compute engines can operate on the same dataset for AI, ML, BI, log analytics, graph processing, and more. Your data is never locked in and you have control over it.

 

 

 

Easy

Cloudera offers the easiest path to adopting, deploying, and using a lakehouse. By integrating Iceberg right into SDX, we provide the fastest path to adopting Iceberg and simplifying your data architecture. It is a metadata-only migration using a single command, without touching any of the underlying large data sets. 

Iceberg also maintains and tracks the state of dataset evolution and changes over time, simplifying data management for large datasets.

 

Easy

Cloudera offers the easiest path to adopting, deploying, and using a lakehouse. By integrating Iceberg right into SDX, we provide the fastest path to adopting Iceberg and simplifying your data architecture. It is a metadata-only migration using a single command, without touching any of the underlying large data sets. 

Iceberg also maintains and tracks the state of dataset evolution and changes over time, simplifying data management for large datasets.

 

 

 

Portable

Your lakehouse can be anywhere, on any cloud or in your own data center. Cloudera provides the same data services and the best-of-breed engines across all form factors so you get the same experience with full portability of data services across all clouds. Build once and run anywhere without any headaches.

 

Portable

Your lakehouse can be anywhere, on any cloud or in your own data center. Cloudera provides the same data services and the best-of-breed engines across all form factors so you get the same experience with full portability of data services across all clouds. Build once and run anywhere without any headaches.

 

 

 

Zero Ops

CDP One is a SaaS data lakehouse with powerful end-to-end analytic and ML tools to accelerate insights without the need for specialized operations staff. The production-ready service is continuously optimized and secured to reduce risk and lower TCO. With low-code tools and powerful analytics, CDP One enables anyone to build easy, integrated workflows.

 

Zero Ops

CDP One is a SaaS data lakehouse with powerful end-to-end analytic and ML tools to accelerate insights without the need for specialized operations staff. The production-ready service is continuously optimized and secured to reduce risk and lower TCO. With low-code tools and powerful analytics, CDP One enables anyone to build easy, integrated workflows.

 

 

 

Secure

The Iceberg tables in CDP integrate within the Shared Data Experience (SDX) metastore, allowing for unified security, fine-grained policies, governance, lineage, and metadata management across multiple clouds, so you can focus on analyzing your data while we take care of making it secure and interoperable.

 

 

 

Secure

The Iceberg tables in CDP integrate within the Shared Data Experience (SDX) metastore, allowing for unified security, fine-grained policies, governance, lineage, and metadata management across multiple clouds, so you can focus on analyzing your data while we take care of making it secure and interoperable.

 

 

 

 

Resources
 

Discover more insights on managing data anywhere

Webinar

Is your data lakehouse really open?

Whitepaper

Data architecture series: Open data lakehouse

World-class training, support, & services

Your form submission has failed.

This may have been caused by one of the following:

  • Your request timed out
  • A plugin/browser extension blocked the submission. If you have an ad blocking plugin please disable it and close this message to reload the page.