Dremio Subsurface: Advanced Storage Solutions

A data lake is a storage repository that holds a vast amount of raw data in its native format until it is needed. While a hierarchical data warehouse stores data in files or folders, a data lake uses a flat architecture to store data. In addition to S3, ADLS & GCS, there are Minio, Dell ECS, IBM, Alibaba and other small cloud providers
The Future of Intelligent Storage in Big Data
Unlocking Potential with Apache Iceberg Table Formats Subsurface LIVE Sessions

The Future of Intelligent Storage in Big Data

Read more
Compaction in Apache Iceberg: Fine-Tuning Your Iceberg Table’s Data Files

Compaction in Apache Iceberg: Fine-Tuning Your Iceberg Table’s Data Files

Learn how to optimize the data files in your Apache Iceberg Table using compaction and its different strategies including z-order.
Read more
Puffins and Icebergs: Additional Stats for Apache Iceberg Tables

Puffins and Icebergs: Additional Stats for Apache Iceberg Tables

A short introduction to the new file format called Puffin in Apache Iceberg that helps with additional table statistics
Read more
The Life of a Read Query for Apache Iceberg Tables

The Life of a Read Query for Apache Iceberg Tables

What happens under the hood with Apache Iceberg when you run a read query.
Read more
Apache Iceberg and the Right to Be Forgotten

Apache Iceberg and the Right to Be Forgotten

Time travel capabilities and privacy laws like GDPR and CCPA are at odds with each other. Here’s how to make sure you’re GDPR/CCPA compliant while using time travel in Apache Iceberg.
Read more