Dremio Blog: Various Insights
-
Dremio Blog: Various Insights
What Is Apache Arrow?
Over the past few decades, databases and data analysis have changed dramatically. With these trends in mind, a clear opportunity emerged for a standard in-memory representation that every engine can use—one that’s modern, takes advantage of all the new performance strategies that are available, and makes sharing of data across platforms seamless and efficient. This […] -
Dremio Blog: Various Insights
Demystifying Cloud Data Lakes: A Comprehensive Guide
A cloud data lake is a cloud-hosted centralized repository that allows you to store all your structured and unstructured data at any scale, typically using an object store such as Amazon S3 or Microsoft Azure Data Lake Storage (ADLS). Its placement in the cloud means it can be interacted with as needed, whether it’s for […] -
Dremio Blog: Various Insights
Azure Storage Types and Use Cases
Azure Storage Types Azure Storage is a Microsoft-managed cloud service that provides storage that is highly available, secure, durable, scalable and redundant. Whether it is images, audio, video, logs, configuration files, or sensor data from an IoT array, data needs to be stored in a way that can be easily accessible for analysis purposes, and […] -
Dremio Blog: Various Insights
What Is Apache Iceberg?
Background on Data Within Data Lake Storage Data lakes are large repositories that store all structured and unstructured data at any scale. They are used to simplify data management by centralizing data and enabling all applications throughout an organization to interact on a shared data repository for all processing, analytics and reporting, significantly improving upon […] -
Dremio Blog: Various Insights
Nessie: Git for Data Lakes
The Rise of Data Lake Storage For decades organizations relied on relational databases, and later enterprise data warehouses, to organize and store corporate data. These systems provided a strong structural model to organize data as well as data consistency and reliability guarantees. However, these aspects were achieved by vertically integrated technology designs that were isolated […] -
Dremio Blog: Various Insights
What is a Data Lake?
A data lake is a centralized repository that allows you to store all of your structured and unstructured data at any scale. In the past, when disk storage was expensive, and data was costly and time-consuming to gather, enterprises needed to be discerning about what data to collect and store. Organizations would carefully design databases and data […] -
Dremio Blog: Various Insights
Data Lake vs Warehouse: Dremio Insights
While data lakes and data warehouses are conceptually different in terms of their design and implementation, they have at least a few things in common: However, this is usually where the similarities end. Before comparing data warehouses and data lakes, it is useful first to explain what we mean by data warehousing. What Is a Data Warehouse? Data warehouses […] -
Dremio Blog: Various Insights
Data Lakes
Data fuels the modern enterprise — today more than ever, businesses compete on their ability to turn big data into essential business insights. Increasingly, enterprises leverage data lakes as the platform used to store data for analytical purposes, combined with various compute engines for processing that data. What Is a Data Lake? A data lake […]