Apache Spark

Apache Spark is an open-source unified analytics engine for large-scale data processing. Spark provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
...

Project Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Read more
...

Driving Better Analytics Using Cloud Data Lakes

Read more
...

Enabling Analysts to Build a Lakehouse with Spark SQL and Iceberg

Read more
...

Iceberg at Adobe: Challenges, Lessons & Achievements

Read more
...

Serverless Cloud Data Lake with Spark for Serving Weather Data

Read more
...

Building an Efficient Data Pipeline for Data Intensive Workloads

Read more