Aug - Oct 2025
Apache Iceberg Lakehouse Workshop
Simplify Apache Iceberg Lakehouse Management with Dremio. This Workshop covers working with Iceberg Lakehouses using PySpark and Dremio. This will illustrate working with Dremio's Intelligent Lakehouse Management for unlocking Iceberg's Full Potential.
Who should join?
Data Engineers and Data Architects Interested in Apache Iceberg
Are there prerequisites?
Docker and High-Level Familiarity with Apache Iceberg
Mastering the Iceberg Lakehouse
As modern organizations grow more data-driven, data teams often rely on a diverse array of tools—making interoperability and governance critical challenges. Apache Iceberg, an open table format, provides the foundation for a scalable and consistent lakehouse architecture. However, managing an Iceberg-based lakehouse at scale introduces complexities, from catalog deployment to query acceleration and performance tuning.
This hands-on workshop explores ingesting data into the lakehouse using pySpark and how Dremio simplifies these challenges by delivering an intelligent lakehouse experience. With its integrated Apache Polaris catalog, automated performance optimization, and semantic layer, Dremio empowers data teams to harness the full potential of Apache Iceberg without the operational burden.
Aug - Oct 2025
Apache Iceberg
Lakehouse Workshop
You will learn to:
- Use PySpark to write data into Iceberg tables managed by Dremio
- Run SQL queries on those tables with Dremio’s intuitive interface
- Explore how Dremio delivers autonomous performance management for Iceberg tables
By the end the session,
you will learn to:
01
|
Use PySpark to write data into Iceberg tables managed by Dremio
02
|
Run SQL queries on those tables with Dremio’s intuitive interface
03
|
Optimize Iceberg Tables with Dremio's Autonomous Performance
Fully Managed Environment
Participants will receive temporary access to a fully managed Dremio Enterprise environment to follow along or watch live demonstrations. Whether you're new to Iceberg or looking to streamline your lakehouse architecture, this workshop offers a practical introduction to building and managing a modern, high-performance data platform.
Speaker: Alex Merced
Alex Merced is Head of DevRel at Dremio with experience as a developer and instructor. His professional journey includes roles at GenEd Systems, Crossfield Digital, CampusGuard, and General Assembly. He co-authored "Apache Iceberg: The Definitive Guide" published by O'Reilly and has spoken at notable events such as Data Day Texas and Data Council. Alex is passionate about technology, sharing his expertise through blogs, videos, podcasts like Datanation and Web Dev 101, and contributions to the JavaScript and Python communities with libraries like SencilloDB and CoquitoJS.
Agenda
Housekeeping and Introduction
|
5m
Setting Up Local Spark Environment
|
10m
Running Queries on Data with Dremio
|
10m
Reviewing Dremio Autonomous Peformance
|
10m
Q&A
|
15m
Upcoming dates
Explore and discover our selection of upcoming hands-on Lakehouse Platform workshops, where you'll gain practical experience and master the latest techniques for building and optimizing your data strategy.
Dates
Hour
August 13
9:00 aM - 10:00 AM PT CLOSED
August 26
6:00 AM - 7:00 AM PT CLOSED
September 17
9:00 aM - 10:00 AM PT
September 24
6:00 aM - 7:00 AM PT
October 8
9:00 aM - 10:00 AM PT
October 22
6:00 aM - 7:00 AM PT
Aug - Oct 2025
Apache Iceberg
Lakehouse Workshop
You will learn to:
- Use PySpark to write data into Iceberg tables managed by Dremio
- Run SQL queries on those tables with Dremio’s intuitive interface
- Explore how Dremio delivers autonomous performance management for Iceberg tables