6 minute read · July 24, 2019
Modern Data Platform and the Cloud

· Data Lake Platform Owner, Raiffeisenbank

Part 1
One of the very first decisions that you are going to face is whether to build your data platform on-prem or on the cloud. The cloud provides an infrastructure as a service, which allows to instantiate a piece of infrastructure on demand with a simple API call. It drastically reduces time to market and enables self-service, autonomous environment. It’s not a surprise then, that analytical agencies promote a “cloud first” approach. However, you probably heard that the cloud is very expensive and can become a ‘money pit’ that is difficult to manage. While it’s true that the cloud is not cheap, with the right approach, it can provide a better TCO (Total Cost of Ownership) and ROI than on-prem solutions, and it can be a real enabler for your data platform. So, what is the secret sauce of success in the cloud? It’s actually very simple. In a few words – decoupling data and compute will enable you to utilize the Cloud Elasticity efficiently. However, the execution of this principle may not be as easy. If you already have an on-prem data lake, most likely it’s a Hadoop cluster that hosts your data lake on HDFS (Hadoop Distributed File System) and executes various use cases – from production ETL processes to unpredictable ad-hoc SQL queries run by your BI and data science teams.





Ready to get started?

Dremio Test Drive
Experience Dremio with sample data
The simplest way to try out Dremio.
Dremio Cloud
Open & fully-managed data lakehouse
Best Option if your data is on AWS. Forever Free Usage.

Dremio Software
Software for any environment
Download Dremio’s Community Edition