Browse all Subsurface content →
Menu
Data Lake Storage
Amazon S3
Azure Data Lake Storage
Google Cloud Storage
File Formats
Apache Parquet
Table Formats
Apache Iceberg
Delta Lake
Metastores
AWS Glue
Hive Metastore
Nessie
Data Lake Engines
Dremio
Apache Spark
Interfaces
Apache Arrow Flight
In-Memory Formats
Apache Arrow
Data Catalog
Amundsen
Business Intelligence
Power BI
Tableau
Apache Superset
Events
Contact Us
Glossary
Subsurface LIVE Winter 2022 sessions are now online!
Nearly 60 live keynotes and breakout sessions.
Watch On Demand Now
Data Lake Storage
Amazon S3
Azure Data Lake Storage
Google Cloud Storage
File Formats
Apache Parquet
Table Formats
Apache Iceberg
Delta Lake
Metastores
AWS Glue
Hive Metastore
Nessie
Data Lake Engines
Dremio
Apache Spark
Interfaces
Apache Arrow Flight
In-Memory Formats
Apache Arrow
Data Catalog
Amundsen
Business Intelligence
Power BI
Tableau
Apache Superset
Search for:
Data Lake Storage
Amazon S3
Azure Data Lake Storage
Google Cloud Storage
File Formats
Apache Parquet
Table Formats
Apache Iceberg
Delta Lake
Metastores
AWS Glue
Hive Metastore
Nessie
Data Lake Engines
Dremio
Apache Spark
Interfaces
Apache Arrow Flight
In-Memory Formats
Apache Arrow
Data Catalog
Amundsen
Business Intelligence
Power BI
Tableau
Apache Superset
Data Lake Storage
File Formats
Table Formats
Metastores
Data Lake Engines
Interfaces
JSON
JSON is an open standard file format that uses human-readable text to store and transmit data objects consisting of attribute–value pairs and arrays.
Optimized Row Columnar (ORC)
JSON
Data Lake Storage
CSV
AWS Glue
AVRO
Apache Parquet
Amazon S3
July 29, 2020
AWS Data Lake Architectures
Read more