The Cloud Data Lake Site

Featured Talks from Subsurface LIVE

Watch the recordings from Subsurface LIVE

The Role of Dremio in a Data Mesh Architecture

The Role of Dremio in a Data Mesh Architecture

Read more
Building a Data Factory: A Generic ETL Pipeline Utility Case Study

Building a Data Factory: A Generic ETL Pipeline Utility Case Study

Read more
Compression, Dedupe and Encryption Conundrums in Cloud Data Lakes

Compression, Dedupe and Encryption Conundrums in Cloud Data Lakes

Read more
Data Discovery at Lyft and Convoy

Data Discovery at Lyft and Convoy

Read more
Data Mesh – Enabled with a Self-Serve Platform

Data Mesh – Enabled with a Self-Serve Platform

Read more
Using Data & Analytics for Equitable Social Good

Using Data & Analytics for Equitable Social Good

Read more
Persistent Memory and the Data Lake

Persistent Memory and the Data Lake

Read more
Build a Big Data Interaction Platform

Build a Big Data Interaction Platform

Read more
Inside Apache Druid’s Storage and Query Engine

Inside Apache Druid’s Storage and Query Engine

Read more
Increasing Performance with Arrow and Gandiva

Increasing Performance with Arrow and Gandiva

Read more
Making Cloud Big Data Platforms Open and Secure with Dynamic Data Authorization

Making Cloud Big Data Platforms Open and Secure with Dynamic Data Authorization

Read more
Dagster: An Orchestrator for the Full Data Lifecycle

Dagster: An Orchestrator for the Full Data Lifecycle

Read more
How to Build a Modern Data Lake and/or Warehouse On-Prem

How to Build a Modern Data Lake and/or Warehouse On-Prem

Read more
Using the Data Mesh Architecture to Democratize Data and Accelerate Time to Insight

Using the Data Mesh Architecture to Democratize Data and Accelerate Time to Insight

Read more
Eliminating the Ugly Plumbing of Data Lake Engineering

Eliminating the Ugly Plumbing of Data Lake Engineering

Read more
Fireside Chat: Extracting Insights through Data Control and an Open Data Lake Architecture

Fireside Chat: Extracting Insights through Data Control and an Open Data Lake Architecture

Read more
Rethinking Ingestion: CI/CD for Data Lakes

Rethinking Ingestion: CI/CD for Data Lakes

Read more
Why and How Netflix Created and Migrated to a New Table Format: Iceberg

Why and How Netflix Created and Migrated to a New Table Format: Iceberg

Read more
Avoiding the Architecture Undertow: Building Lighting-Fast Queries with Blazing Fast Object Storage

Avoiding the Architecture Undertow: Building Lighting-Fast Queries with Blazing Fast Object Storage

Read more
Root Cause Analysis for Your Data Lake

Root Cause Analysis for Your Data Lake

Read more
Orchestrating Data Validation Workflows with Prefect

Orchestrating Data Validation Workflows with Prefect

Read more
Platform-Agnostic Lineage Validates and Creates Trust in Your Data Lake

Platform-Agnostic Lineage Validates and Creates Trust in Your Data Lake

Read more
Coral and Transport: Portable SQL and UDFs for Modern Data Lakes

Coral and Transport: Portable SQL and UDFs for Modern Data Lakes

Read more
Introducing the Apache Hudi Table Format, Purpose-Built for Low-Latency Data Lake Use Cases

Introducing the Apache Hudi Table Format, Purpose-Built for Low-Latency Data Lake Use Cases

Read more
Scaling Python Data Science with Dask

Scaling Python Data Science with Dask

Read more
Eliminate Data Pipeline Downtime with Reliable Data Processing, Quality and Consistency

Eliminate Data Pipeline Downtime with Reliable Data Processing, Quality and Consistency

Read more
Azure Storage Types and Use Cases

Azure Storage Types and Use Cases

Read more