Apache Spark

Apache Spark is an open-source unified analytics engine for large-scale data processing. Spark provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
...
Data Lake Engines Apache Spark Subsurface LIVE Sessions

Tracking & Triggering Pattern with Spark Stateful Streaming

Using Apache Spark Stateful Streaming to create services that minimize processing time while keeping everything under defined SLAs.
Read more
...

Project Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Read more
...
Dremio Data Lake Engines Apache Spark Subsurface LIVE Sessions

Build a Big Data Interaction Platform

Read more
...

Driving Better Analytics Using Cloud Data Lakes

Read more