Browse all Subsurface content →
Menu
Data Lake Storage
Amazon S3
Azure Data Lake Storage
Google Cloud Storage
File Formats
Apache Parquet
Table Formats
Apache Iceberg
Delta Lake
Metastores
AWS Glue
Hive Metastore
Nessie
Data Lake Engines
Dremio
Apache Spark
Interfaces
Apache Arrow Flight
In-Memory Formats
Apache Arrow
Data Catalog
Amundsen
Business Intelligence
Power BI
Tableau
Apache Superset
Events
Contact Us
Glossary
Subsurface LIVE Winter 2022 sessions are now online!
Nearly 60 live keynotes and breakout sessions.
Watch On Demand Now
Data Lake Storage
Amazon S3
Azure Data Lake Storage
Google Cloud Storage
File Formats
Apache Parquet
Table Formats
Apache Iceberg
Delta Lake
Metastores
AWS Glue
Hive Metastore
Nessie
Data Lake Engines
Dremio
Apache Spark
Interfaces
Apache Arrow Flight
In-Memory Formats
Apache Arrow
Data Catalog
Amundsen
Business Intelligence
Power BI
Tableau
Apache Superset
Search for:
Data Lake Storage
Amazon S3
Azure Data Lake Storage
Google Cloud Storage
File Formats
Apache Parquet
Table Formats
Apache Iceberg
Delta Lake
Metastores
AWS Glue
Hive Metastore
Nessie
Data Lake Engines
Dremio
Apache Spark
Interfaces
Apache Arrow Flight
In-Memory Formats
Apache Arrow
Data Catalog
Amundsen
Business Intelligence
Power BI
Tableau
Apache Superset
Data Lake Storage
File Formats
Table Formats
Metastores
Data Lake Engines
Interfaces
Apache Spark
Apache Spark is an open-source unified analytics engine for large-scale
data processing
. Spark provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
Table Formats
Data Lake Storage
Data Lake Engines
Apache Spark
Apache Iceberg
Amazon S3
September 13, 2022
How Z-Ordering in Apache Iceberg Helps Improve Performance
This tutorial introduces the Z-order clustering algorithm in Apache Iceberg and explains how it adds value to the file optimization strategy.
Read more
Table Formats
Data Lake Engines
Apache Spark
Apache Iceberg
September 12, 2022
Apache Iceberg 101 – Your Guide to Learning Apache Iceberg Concepts and Practices
This article provides an introductory course on the concepts and practices of Apache Iceberg tables for running scalable data lakehouses.
Read more
Table Formats
Data Lake Engines
Apache Spark
Apache Iceberg
September 9, 2022
Getting Started with Apache Iceberg in Databricks
Getting started with Apache Iceberg in Databricks is straightforward. This article walks through the setup and usage step by step.
Read more
Data Lake Engines
Apache Spark
March 30, 2022
Tracking & Triggering Pattern with Spark Stateful Streaming
Read more
Load More