Menu
  • Data Lake Storage
    • Amazon S3
    • Azure Data Lake Storage
    • Google Cloud Storage
  • File Formats
    • Apache Parquet
  • Table Formats
    • Apache Iceberg
    • Delta Lake
  • Metastores
    • AWS Glue
    • Hive Metastore
    • Nessie
  • Data Lake Engines
    • Dremio
    • Apache Spark
  • Interfaces
    • Apache Arrow Flight
  • In-Memory Formats
    • Apache Arrow
  • Data Catalog
    • Amundsen
  • Business Intelligence
    • Power BI
    • Tableau
    • Apache Superset
  • Subsurface LIVE Winter 2022 sessions are now online!

    Nearly 60 live keynotes and breakout sessions.

    Watch On Demand Now
  • Data Lake Storage
    • Amazon S3
    • Azure Data Lake Storage
    • Google Cloud Storage
  • File Formats
    • Apache Parquet
  • Table Formats
    • Apache Iceberg
    • Delta Lake
  • Metastores
    • AWS Glue
    • Hive Metastore
    • Nessie
  • Data Lake Engines
    • Dremio
    • Apache Spark
  • Interfaces
    • Apache Arrow Flight
  • In-Memory Formats
    • Apache Arrow
  • Data Catalog
    • Amundsen
  • Business Intelligence
    • Power BI
    • Tableau
    • Apache Superset
  • Resource Library

    Sharing ideas and information is a cornerstone of Subsurface. Our Subsurface LIVE sessions and informational resources offer a quick and engaging way to learn more about data lakes.

    Filter by Event

    Filter by Topic

    Reset

    Read Now

    May 14, 2022

    Row-Level Changes on the Lakehouse: Copy-On-Write vs. Merge-On-Read in Apache Iceberg

    How copy-on-write and merge-on-read work in Apache Iceberg.

    January 27, 2021

    Centralized Security and Governance in the Cloud

     

    July 29, 2020

    Smart Data Lakes for Predictive and Prescriptive Analytics

     

    Migrating to Parquet – The Veraset Story

     

    Introducing InfluxDB IOx, a Federated In-Memory Columnar Store Backed by Object Storage

     

    Implementing a Data Mesh Architecture at JPMC

     

    Iceberg at Adobe: Challenges, Lessons & Achievements

     

    High-Performance Big Data Analytics Processing Using Hardware Acceleration

     

    High Frequency Small Files vs. Slow Moving Datasets

     

    GOing Native with Arrow Flight and Dremio

     

    Flexible Data Lake Architectures for Seamless Real-time Data and Machine Learning Integrations

     

    Enabling Real-Time Analytics for Data Lakes with Apache Ignite

     

    Designing Performant, Scalable, and Secure Data Lakes

     

    Data Observability for Data Lakes: The Next Frontier of Data Engineering

     

    Data Lineage with Apache Airflow

     

    Centralized Security and Governance in the Cloud

     

    The Cloud Data Lake Community

    The Subsurface Community is a forum for sharing trends and strategies propelling today's cloud data lake ecosystem, including data lakehouses, ETL, orchestration, data quality, and visualization.

    Follow Us on Twitter
    Join Us on Slack
    Connect With Us
    © Copyright - Subsurface Live 2022