Tuning Row-Level Operations in Apache Iceberg

Companies leverage Apache Iceberg to build reliable and efficient data lakes with features that are normally present only in data warehouses. As users begin to use Apache Iceberg in a bigger range of data processing scenarios, it is essential to support efficient and transactional delete/update/merge operations even in read-mostly data lake environments.

This talk will be a deep dive into the copy-on-write and merge-on-read approaches for executing row-level operations in Apache Iceberg so that users can pick the correct implementation for a given use case. In addition, the presentation will help data engineers to avoid common mistakes and tune delete/update/merge operations at scale.

Topics Covered

Apache Iceberg
Table Formats


Anton Okolnychyi

Anton Okolnychyi

Anton is a committer and PMC member of Apache Iceberg as well as an Apache Spark contributor at Apple.

Ready to Get Started? Here Are Some Resources to Help

Case Study

When E-Commerce Explodes – The More Data the More Dremio

read more


Real-World Strategies to Optimize Data Platform Cost

read more
On-Demand webinar graphic


Centralize Data Security Governance on your Open Data Lakehouse with Dremio & Privacera

read more

Get Started Free

No time limit - totally free - just the way you like it.

Sign Up Now

See Dremio in Action

Not ready to get started today? See the platform in action.

Watch Demo

Talk to an Expert

Not sure where to start? Get your questions answered fast.

Contact Us