Table Formats

The function of a table format is to determine how you manage, organize and track all of the files that make up a table. You can think of it as an abstraction layer between your physical data files (written in Parquet or ORC etc.) and how they are structured to form a table.
Compaction in Apache Iceberg: Fine-Tuning Your Iceberg Table’s Data Files

Compaction in Apache Iceberg: Fine-Tuning Your Iceberg Table’s Data Files

Learn how to optimize the data files in your Apache Iceberg Table using compaction and its different strategies including z-order.
Read more
Puffins and Icebergs: Additional Stats for Apache Iceberg Tables

Puffins and Icebergs: Additional Stats for Apache Iceberg Tables

A short introduction to the new file format called Puffin in Apache Iceberg that helps with additional table statistics
Read more
The Life of a Read Query for Apache Iceberg Tables

The Life of a Read Query for Apache Iceberg Tables

What happens under the hood with Apache Iceberg when you run a read query.
Read more
Apache Iceberg and the Right to Be Forgotten

Apache Iceberg and the Right to Be Forgotten

Time travel capabilities and privacy laws like GDPR and CCPA are at odds with each other. Here’s how to make sure you’re GDPR/CCPA compliant while using time travel in Apache Iceberg.
Read more