category-logo

Dremio Subsurface for Apache Spark

Apache Spark is an open-source unified analytics engine for large-scale data processing. Spark provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
How Z-Ordering in Apache Iceberg Helps Improve Performance

How Z-Ordering in Apache Iceberg Helps Improve Performance

This tutorial introduces the Z-order clustering algorithm in Apache Iceberg and explains how it adds value to the file optimization strategy.
Read more
Apache Iceberg 101 – Your Guide to Learning Apache Iceberg Concepts and Practices

Apache Iceberg 101 – Your Guide to Learning Apache Iceberg Concepts and Practices

This article provides an introductory course on the concepts and practices of Apache Iceberg tables for running scalable data lakehouses.
Read more
Getting Started with Apache Iceberg in Databricks

Getting Started with Apache Iceberg in Databricks

Getting started with Apache Iceberg in Databricks is straightforward. This article walks through the setup and usage step by step.
Read more
Tracking & Triggering Pattern with Spark Stateful Streaming

Tracking & Triggering Pattern with Spark Stateful Streaming

Read more