Menu
Data Lake Storage
Amazon S3
Azure Data Lake Storage
Google Cloud Storage
File Formats
Apache Parquet
Table Formats
Apache Iceberg
Delta Lake
Metastores
AWS Glue
Hive Metastore
Nessie
Data Lake Engines
Dremio
Apache Spark
Interfaces
Apache Arrow Flight
In-Memory Formats
Apache Arrow
Data Catalog
Amundsen
Business Intelligence
Power BI
Tableau
Apache Superset
Events
Contact Us
Subsurface LIVE Winter 2022 sessions are now online!
Nearly 60 live keynotes and breakout sessions.
Watch On Demand Now
Data Lake Storage
Amazon S3
Azure Data Lake Storage
Google Cloud Storage
File Formats
Apache Parquet
Table Formats
Apache Iceberg
Delta Lake
Metastores
AWS Glue
Hive Metastore
Nessie
Data Lake Engines
Dremio
Apache Spark
Interfaces
Apache Arrow Flight
In-Memory Formats
Apache Arrow
Data Catalog
Amundsen
Business Intelligence
Power BI
Tableau
Apache Superset
Search for:
Resource Library
Sharing ideas and information is a cornerstone of Subsurface. Our Subsurface LIVE sessions and informational resources offer a quick and engaging way to learn more about data lakes.
Filter by Event
Filter by Topic
Reset
Read Now
May 14, 2022
Row-Level Changes on the Lakehouse: Copy-On-Write vs. Merge-On-Read in Apache Iceberg
How copy-on-write and merge-on-read work in Apache Iceberg.
January 27, 2021
Centralized Security and Governance in the Cloud
July 29, 2020
Smart Data Lakes for Predictive and Prescriptive Analytics
Migrating to Parquet – The Veraset Story
Introducing InfluxDB IOx, a Federated In-Memory Columnar Store Backed by Object Storage
Implementing a Data Mesh Architecture at JPMC
Iceberg at Adobe: Challenges, Lessons & Achievements
High-Performance Big Data Analytics Processing Using Hardware Acceleration
High Frequency Small Files vs. Slow Moving Datasets
GOing Native with Arrow Flight and Dremio
Flexible Data Lake Architectures for Seamless Real-time Data and Machine Learning Integrations
Enabling Real-Time Analytics for Data Lakes with Apache Ignite
Designing Performant, Scalable, and Secure Data Lakes
Data Observability for Data Lakes: The Next Frontier of Data Engineering
Data Lineage with Apache Airflow
Centralized Security and Governance in the Cloud