July 14, 2025

Lessons Learned from Running Merge-on-Read Iceberg Pipelines at Scale

Organizations are leveraging merge-on-read Apache Iceberg operations to efficiently handle sparse updates. This talk will share insights from running such operations on tables with tens of petabytes of data. You’ll learn when to choose merge-on-read over copy-on-write execution mode, how to optimize the write performance, and the best practices for maintaining such tables using Apache Iceberg’s built-in tools. This presentation will benefit engineers considering Apache Iceberg adoption, as well as those who already use it and seek to enhance their existing production environments.

Topics Covered

ELT/ETL

Video Unavailable

Check again soon!

Speaker

Anton Okolnychyi

Software Engineer, Data Infrastructure at Apple