AVRO

Avro is a row-oriented remote procedure call and data serialization framework developed within Apache's Hadoop project. It uses JSON for defining data types and protocols, and serializes data in a compact binary format.
...
Optimized Row Columnar (ORC) AVRO Apache Parquet Apache Iceberg Apache Icerbg Big Data Data Lake Data Lakehouse File Format Open Table Format

Puffins and Icebergs: Additional Stats for Apache Iceberg Tables

A short introduction to the new file format called Puffin in Apache Iceberg that helps with additional table statistics
Read more
...

AWS Data Lake Architectures

Read more