Data Lake Engines

A data lake engine is an application or service which queries and/or processes the vast sets of data stored in data lake storage. Data lake processing engines like Apache Spark are often used for batch data transformation jobs and machine learning. Data lake query engines such as Dremio and Presto are used to analyze structured and semi-structured data in place for business intelligence (BI) and data science.
...

Project Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Read more
...

DataOps in a Manufacturing Company – Anomaly or Solution?

Read more
...

Unified Analytics and Workloads with Dremio and HPE Ezmeral

Read more
...

Building a Data Lake Applied Platform at Raiffeisenbank

Read more