Data Lake Storage

A data lake is a storage repository that holds a vast amount of raw data in its native format until it is needed. While a hierarchical data warehouse stores data in files or folders, a data lake uses a flat architecture to store data. In addition to S3, ADLS & GCS, there are Minio, Dell ECS, IBM, Alibaba and other small cloud providers
Table Formats Apache Iceberg Subsurface LIVE Sessions

The Future of Intelligent Storage in Big Data

Read more
...

Table Format Partitioning Comparison: Apache Iceberg, Apache Hudi, and Delta Lake

Learn about the differences in partitioning with Apache Iceberg, Apache Hudi, and Delta Lake.
Read more
...

Laying the foundation of a Data Lakehouse with AWS Glue, Apache Iceberg and Dremio

Read more
...

Migrating a Hive Table to an Iceberg Table Hands-on Tutorial

Learn how to migrate your existing Hive tables into Apache Iceberg tables to take full advantage of features like Version Rollback, Partition Evolution and more.
Read more
...

Apache Iceberg: An Architectural Look Under the Covers

Read more