Browse all Subsurface content →
Menu
Data Lake Storage
Amazon S3
Azure Data Lake Storage
Google Cloud Storage
File Formats
Apache Parquet
Table Formats
Apache Iceberg
Delta Lake
Metastores
AWS Glue
Hive Metastore
Nessie
Data Lake Engines
Dremio
Apache Spark
Interfaces
Apache Arrow Flight
In-Memory Formats
Apache Arrow
Data Catalog
Amundsen
Business Intelligence
Power BI
Tableau
Apache Superset
Events
Contact Us
Subsurface LIVE Winter 2022 sessions are now online!
Nearly 60 live keynotes and breakout sessions.
Watch On Demand Now
Data Lake Storage
Amazon S3
Azure Data Lake Storage
Google Cloud Storage
File Formats
Apache Parquet
Table Formats
Apache Iceberg
Delta Lake
Metastores
AWS Glue
Hive Metastore
Nessie
Data Lake Engines
Dremio
Apache Spark
Interfaces
Apache Arrow Flight
In-Memory Formats
Apache Arrow
Data Catalog
Amundsen
Business Intelligence
Power BI
Tableau
Apache Superset
Search for:
Data Lake Storage
Amazon S3
Azure Data Lake Storage
Google Cloud Storage
File Formats
Apache Parquet
Table Formats
Apache Iceberg
Delta Lake
Metastores
AWS Glue
Hive Metastore
Nessie
Data Lake Engines
Dremio
Apache Spark
Interfaces
Apache Arrow Flight
In-Memory Formats
Apache Arrow
Data Catalog
Amundsen
Business Intelligence
Power BI
Tableau
Apache Superset
Dremio Subsurface: Advanced Storage Solutions
File Formats
Table Formats
Metastores
Data Lake Engines
Interfaces
File Formats
A file format is a standard way that information is encoded for storage in a computer file. It specifies how bits are used to encode information in a digital storage medium.
Unlocking Potential with Apache Iceberg
Optimized Row Columnar (ORC)
Dremio Subsurface for Apache Parquet
AVRO
October 27, 2022
Puffins and Icebergs: Additional Stats for Apache Iceberg Tables
A short introduction to the new file format called Puffin in Apache Iceberg that helps with additional table statistics
Read more
File Formats
Dremio Subsurface for Apache Parquet
March 2, 2022
1 Stone, 3 Birds: Finer – Grained Encryption @ Apache Parquet
Read more
Unlocking Potential with Apache Iceberg
Table Formats
Subsurface: Nessie Project Insights
Metastores
File Formats
Dremio Subsurface for Apache Spark
Data Lake Engines
CSV
September 27, 2021
Project Nessie: Transactional Catalog for Data Lakes with Git-like semantics
Read more
Unlocking Potential with Apache Iceberg
Table Formats
Subsurface: Nessie Project Insights
Metastores
In-Memory Formats
Dremio Subsurface for Apache Parquet
Dremio Subsurface for Apache Arrow
Apache Arrow Flight
July 22, 2021
Panel: Open Data Architecture
Read more
Load More