ClouderaNOW   Navigate data architectures, sovereign clouds, & edge data for AI   |   July 15

Register

Overview

In this 2-day instructor-led training, participants will learn how to address the challenges of modern data architectures.

Modern enterprises struggle with fragmented data spread across data lakes, warehouses, and operational systems. Traditional ETL pipelines introduce latency, increase costs, and slow decision-making.

This training enables organizations to adopt real-time data federation using Trino, allowing teams to query distributed data sources instantly—without duplication or movement.

Participants will learn how to build enterprise-grade federated analytics architectures that reduce data engineering overhead, accelerate insights, and unlock business value from existing data ecosystems.

Business Value & Outcomes

By the end of this training, organizations can:

  • Eliminate ETL bottlenecks and reduce pipeline complexity through query-in-place federation

  • Accelerate time-to-insight from hours/days to seconds with real-time distributed querying

  • Reduce data movement and infrastructure costs by avoiding unnecessary data duplication

  • Enable real-time, cross-system analytics across data lakes, databases, and cloud platforms

  • Improve governance, security, and team productivity through unified and controlled data access

Enterprise Use Case

Real-Time Fraud Detection (Banking)

  • Analyze transactions and customer data across multiple systems in real time using Trino-based federation — without data movement.

  • Outcome: Faster fraud detection, unified insights, and reduced infrastructure cost

Capabilities You Will Build

  • Design enterprise-grade data federation architectures across systems

  • Execute and optimize high-performance distributed SQL queries

  • Configure and manage Trino connectors for cross-system access

  • Diagnose and resolve query bottlenecks and performance issues

  • Build scalable, production-ready federation solutions


Download full course description 

Book the course

Day 1 – Distributed SQL & Federation Foundations

Introduction to Trino in the Enterprise

  • Distributed SQL architecture

  • Trino vs Hive/Impala

  • Enterprise use cases

  • Lab: Explore Trino Metadata & SQL Dialect

Architecture & Query Execution Deep Dive

  • Coordinator and worker architecture

  • Query lifecycle

  • Memory and spill management

  • Lab: Deep-Dive Query Plan Analysis

Enterprise Data Federation & Virtualization

  • Data virtualization

  • Federation vs replication

  • Cross-domain analytics

  • Lab: Design a Multi-Domain Federation Architecture

Multi-Catalog & Cross-Domain Queries

  • Catalog configuration

  • Cross-catalog joins

  • Query pushdown

  • Lab: Build and Query a Multi-Catalog Federation

Day 2 – Performance & Enterprise Deployment

Advanced Query Optimization

  • Predicate pushdown

  • Dynamic filtering

  • Join optimization

  • Cost-based optimization

  • Lab: Tune a Complex Federated Query

Monitoring & Troubleshooting

  • Query metrics

  • Bottleneck analysis

  • Memory tuning

  • Lab: Diagnose and Remediate a Performance Problem

Workload Management & Resource Control

  • Resource groups

  • Query prioritization

  • Concurrency control

  • Multi-tenancy

  • Lab: Configure Resource Groups for Multi-Tenant Isolation

Enterprise Architecture & Production Best Practices

  • Hybrid federation

  • High availability

  • Scaling strategies

  • Production deployment patterns

  • Lab: Extended Capstone — Trino Federation Design Review

Ready to Get Started?

Your form submission has failed.

This may have been caused by one of the following:

  • Your request timed out
  • A plugin/browser extension blocked the submission. If you have an ad blocking plugin please disable it and close this message to reload the page.