Table Formats

The function of a table format is to determine how you manage, organize and track all of the files that make up a table. You can think of it as an abstraction layer between your physical data files (written in Parquet or ORC etc.) and how they are structured to form a table.
...

Project Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Read more
...

Lessons Learned From Running Apache Iceberg at Petabyte Scale

How to maintain Iceberg tables in their optimal shapes while running at petabyte scale.

Read more
...

Covering Indexes in the Data Lake with Hyperspace

Read more
...

Introducing the Apache Hudi Table Format, Purpose-Built for Low-Latency Data Lake Use Cases

Read more
...

Iceberg Case Studies

Read more
...

Enabling Analysts to Build a Lakehouse with Spark SQL and Iceberg

Read more
...

What Is Apache Iceberg?

Read more
...

Hiveberg: Integrating Apache Iceberg with the Hive Metastore

Read more
...

The Future of Intelligent Storage in Big Data

Read more
...

Apache Iceberg: What’s New

Read more
...

Iceberg at Adobe: Challenges, Lessons & Achievements

Read more
...

Deep Dive into Iceberg SQL Extensions

Read more
...

High Frequency Small Files vs. Slow Moving Datasets

Read more
...

The New Data Tier

Read more