Cloudera's commitment to an open data lakehouse empowers customers with the flexibility to use any engine or tool of choice—whether from Cloudera, other vendors, or open source. We understand the complexity of modern data ecosystems, and our engine-neutral approach ensures seamless collaboration across teams accessing data to build analytical or AI applications and agents. We continuously enhance our lakehouse with innovative features for speed, security, automation, and interoperability, ensuring all engines run concurrently and efficiently and have access to all features and optimizations.
The Cloudera Lakehouse Optimizer provides predictive and intelligent optimizations, automating Apache Iceberg table maintenance and ensuring your open data lakehouse remains performant, scalable, and cost-effective. This service empowers data teams with a cost-efficient lakehouse for all their AI and analytical workloads.
We know that performance and cost efficiency are paramount, which is why we're sharing compelling results from our internal benchmarks. We tested Cloudera Lakehouse Optimizer using 7 TPC-DS tables (107 GB of data), executing TPC-DS queries before and after optimization. Even after accounting for caching and removing outliers, the results are significant:
13x faster queries: Our data shows an average 13x query time improvement, reducing average query time from 24 seconds to a mere 1.8 seconds after optimization!
36% storage cost reduction: Cloudera Lakehouse Optimizer also drives substantial cost savings by optimizing your storage footprint. Our benchmarks revealed a 36% reduction in dataset size–from 107 GB to 68 GB. This directly translates to a lower total cost of ownership (TCO).
These results demonstrate how Cloudera Lakehouse Optimizer improves query performance for downstream AI, reporting, and analytics, and also significantly reduces your storage costs.
Whether you're a platform lead focused on cost controls, a data architect designing scalable solutions, or a data engineer streamlining processes, Cloudera Lakehouse Optimizer is built for you. It comes with policy templates and defaults, enabling immediate optimization without extensive configuration. For specific requirements, the graphical user interface (GUI) and application programming interface (API) offer best-in-class controls.
Let's explore how Cloudera Lakehouse Optimizer uniquely tackles table optimization to deliver these performance and storage benefits:
Intelligent policies: Cloudera Lakehouse Optimizer assesses whether a table requires optimization, ensuring only necessary actions are executed, and autonomously runs the optimizations as and when necessary. It offers rich and configurable action arguments against all Iceberg optimizations, covering a large set of arguments to enable maximum performance.
Engine and storage agnostic: Once the tables are optimized by the Lakehouse Optimizer, any engine accessing the data from the lakehouse will see exactly the same improvements in the performance of the queries, whether those engines are Cloudera owned, open source, or from another vendor. These optimizations also apply to data stored in any cloud object storage or on-premises object stores.
Unmatched scope and control: Cloudera Lakehouse Optimizer allows granular control over policy application. You can create and apply policies at the table, namespace, or even entire catalog level, offering flexible and scalable management as your lakehouse evolves and allows for optimizations to be defined against nearly all arguments, enabling the best policy definition for your tables. This broad scope is a significant differentiator compared to other solutions with more limited policy application. The optimizer also includes a dedicated GUI, enabling all users to comfortably configure and monitor optimizations. For programmatic control, comprehensive API/command line interface (CLI) access is also available, ensuring ease of use for all. It also provides unparalleled flexibility and control over when and how optimizations run:
Experience the power of automated, intelligent Iceberg table optimization and realize significant performance and cost benefits today.
Learn more about the Cloudera Lakehouse Optimizer by watching a demo.
Take advantage of our special promotional offer: All data processed through Cloudera Lakehouse Optimizer will be free until April 26th, 2026! While there is a minimal base cost, this promotion ensures you can explore Cloudera Lakehouse Optimizer’s capabilities without worrying about data processing fees. Furthermore, you can set consumption limits via the Cloudera Management Console to ensure costs never exceed your expectations.
This may have been caused by one of the following: