Data Lake Engines

A data lake engine is an application or service which queries and/or processes the vast sets of data stored in data lake storage. Data lake processing engines like Apache Spark are often used for batch data transformation jobs and machine learning. Data lake query engines such as Dremio and Presto are used to analyze structured and semi-structured data in place for business intelligence (BI) and data science.
...

Project Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Read more
...

DataOps in a Manufacturing Company – Anomaly or Solution?

Read more
...

Unified Analytics and Workloads with Dremio and HPE Ezmeral

Read more
...

Building a Data Lake Applied Platform at Raiffeisenbank

Read more
...

Driving Better Analytics Using Cloud Data Lakes

Read more
...

Enabling Analysts to Build a Lakehouse with Spark SQL and Iceberg

Read more
...

Iceberg at Adobe: Challenges, Lessons & Achievements

Read more
...

GOing Native with Arrow Flight and Dremio

Read more
...

Serverless Cloud Data Lake with Spark for Serving Weather Data

Read more
...

Building an Efficient Data Pipeline for Data Intensive Workloads

Read more
...

Reducing Time to Market with S7 Airlines Self Service Data Platform

Read more