Data Lake Engines

A data lake engine is an application or service which queries and/or processes the vast sets of data stored in data lake storage. Data lake processing engines like Apache Spark are often used for batch data transformation jobs and machine learning. Data lake query engines such as Dremio and Presto are used to analyze structured and semi-structured data in place for business intelligence (BI) and data science.
...
Data Lake Engines Apache Spark Subsurface LIVE Sessions

Tracking & Triggering Pattern with Spark Stateful Streaming

Using Apache Spark Stateful Streaming to create services that minimize processing time while keeping everything under defined SLAs.
Read more
...

Project Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Read more
...
Dremio Data Lake Storage Data Lake Engines Subsurface LIVE Sessions

How to Build a Modern Data Lake and/or Warehouse On-Prem

Read more
...
Dremio Data Lake Engines Apache Spark Subsurface LIVE Sessions

Build a Big Data Interaction Platform

Read more