Alex Merced

Blog Post

Comparing Apache Iceberg to Other Data Lakehouse Solutions

GET A FREE COPY OF “Apache Iceberg: The Definitive Guide” ENROLL IN THE “Apache Iceberg Crash Course” The data lakehouse concept has emerged as a revolutionary solution, blending the best of data lakes and data warehouses. As organizations strive to harness the full potential of their data, choosing the right data lakehouse solution becomes crucial. […]

Read more ->

Gnarly Data Waves Episode

Apache Iceberg Lakehouse Crash Course – What is a Data Lakehouse and What is a Table Format?

"An Apache Iceberg Lakehouse Crash Course," a comprehensive webinar series designed to deepen your understanding of Apache Iceberg and its role in modern data lakehouse architectures. Over ten sessions, we'll cover everything from the basics of data lakehouses and table…

Read more ->

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – Ingesting Data into Apache Iceberg with Apache Spark

Read more ->

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – Ingesting Data into Apache Iceberg with Apache Spark

Read more ->

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – Versioning with Apache Iceberg

Read more ->

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – The Role of Apache Iceberg Catalogs

Read more ->

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – Streaming with Apache Iceberg

Read more ->

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – Optimizing Apache Iceberg Tables

Read more ->

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – Understanding Apache Iceberg’s Partitioning Features

Read more ->

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – The Read and Write Process for Apache Iceberg Tables

Read more ->

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – The Architecture of Apache Iceberg, Apache Hudi, and Delta Lake

Dive into the architectural intricacies of leading table formats Apache Iceberg, Apache Hudi, and Delta Lake. This session will cover: - Core components and design principles of each table format - Comparison of features and use cases - How to…

Read more ->

Blog Post

Apache Iceberg Crash Course: What is a Data Lakehouse and a Table Format?

Apache Iceberg Education Resources: Welcome to the “Apache Iceberg Crash Course” blog series, which complements our webinar series of the same name. Each blog post in this series is designed to provide a comprehensive summary of the content covered in the corresponding webinar session. In this inaugural post, we will explore the fundamentals of Data […]

Read more ->

Gnarly Data Waves Episode

Build the next-generation Iceberg lakehouse with Dremio and NetApp

Transcript Note: This transcript was created using speech recognition software. While it has been reviewed by human transcribers, it may contain errors. Opening Alex Merced: Hey, everybody! This is Alex Merced, and welcome to another episode of Gnarly Data Waves presented to you by Dremio. As usual, here I am as your host, and today, […]

Join our upcoming webinar to explore the future of data lakes and discover how NetApp and Dremio can revolutionize your analytics by delivering the next-generation of lakehouse with Apache Iceberg.

Read more ->

Blog Post

The Unified Apache Iceberg Lakehouse: Self Service & Ease of Use

Data Mesh, Data Lakehouse, Data Fabric, Data Virtualization—there are many buzzwords describing ways to build your data platform. Regardless of the terminology, everyone seeks the same core features in their data platform: Many of these “Data X” concepts address different aspects of these goals. However, when you integrate solutions that cover all these needs, you […]

Read more ->

Blog Post

The Unified Lakehouse: Performant Data Access

Read more ->

Blog Post

The Unified Apache Iceberg Lakehouse: Unified Analytics

Read more ->

Blog Post

Enhancing your Snowflake Data Warehouse with the Dremio Lakehouse Platform

Snowflake is a popular data platform for its scalability, performance, and ease of use. It has revolutionized data warehousing by providing a fully managed service with built-in support for SQL and advanced analytics. Snowflake excels at handling large volumes of data, supporting complex queries, and integrating seamlessly with various data sources and tools. However, building […]

Read more ->

Blog Post

How Apache Iceberg is Built for Open Optimized Performance

Apache Iceberg is a table format designed for data lakehouses. While many people focus on how table formats enable database-like ACID transactions on data lakes—allowing them to function like data warehouses, or “data lakehouses”—there is another equally powerful aspect: the metadata provided by these formats. This metadata can be used to execute transactions with optimal […]

Read more ->

Blog Post

The Value of Dremio’s Semantic Layer and The Apache Iceberg Lakehouse to the Snowflake User

When it comes to processing data for analytics and AI, one of the most popular and ubiquitous platforms is Snowflake. It improves the lives of its users with a user-friendly platform that allows: Snowflake is widely popular for good reason. However, with the advent of the Apache Iceberg data lakehouse, even more possibilities emerge. Incorporating […]

Read more ->

Blog Post

What is Data Virtualization? What makes an Ideal Data Virtualization Platform?

Data virtualization allows you to interact with multiple data systems through a single interface, treating them as a unified system. This capability lets you view all datasets across various systems and execute queries that utilize data from multiple sources simultaneously. This reduces the traditional need to move data across multiple systems to eventually make use […]

Read more ->

Gnarly Data Waves Episode

Best of Subsurface 2024

The "Best of Subsurface 2024" webinar offers a comprehensive recap of the top moments from the Subsurface conference. Attendees will gain insights from industry leaders on data lakehouse implementations, open source advancements, and the future of data engineering.

Read more ->

Blog Post

The Nessie Ecosystem and the Reach of Git for Data for Apache Iceberg

Introduction The Data Lakehouse is rapidly emerging as the ideal data architecture, utilizing a single source of truth on your data lake. This is made possible by technologies like Apache Iceberg and Project Nessie. Apache Iceberg, a revolutionary table format, allows you to organize files on your data lake into database tables and execute efficient […]

Read more ->

Blog Post

The Who, What and Why of Data Reflections and Apache Iceberg for Query Acceleration

The quest for speed and efficiency is paramount, yet traditional approaches like materialized views, OLAP cubes, and BI extracts often fall short. While useful, these solutions can introduce complexities in maintenance, lead to increased storage costs, and suffer from latency issues that can hinder real-time decision-making. Recognizing these challenges, this article delves into an innovative […]

Read more ->

Blog Post

The Evolution of Apache Iceberg Catalogs

Apache Iceberg is a data lakehouse table format revolutionizing the data industry with unique features such as advanced partitioning, ACID guarantees, schema evolution, time travel, and more. Central to the functionality of Apache Iceberg tables is their catalog mechanism, which plays a crucial role in the evolution of how these tables are used and their […]

Read more ->

Subsurface Session

Demystifying Data Governance: How Dremio Enables Governed Data Sharing

Strong data governance policies and approaches are essential for data-driven organizations. Data governance helps ensure the quality and reliability of data to drive accurate decision-making. It establishes clear roles and responsibilities, reducing the risk of data misuse. And, data governance…

Read more ->

Subsurface Session

Unleashing Data Agility with Virtual Data Marts and ZeroETL: The End of ETL as We Know It

Traditional ETL processes bog down efficiency with complexity and cost. Learn how Dremio’s Virtual Data Marts simplify data pipeline creation and management. We’ll examine the challenges and costs associated with excessive data movement and talk about how Dremio’s approach to…

Read more ->

Subsurface Session

Best Practices for Building an Iceberg Data Lakehouse with Dremio

A data lakehouse combines the flexibility and scalability of the data lake with the data management, governance, and analytics of the data warehouse. Open table formats like Apache Iceberg make it possible to efficiently manage and leverage data while maintaining…

Read more ->

Subsurface Session

Next-Gen DataOps with Iceberg & Git for Data

Tomer Shiran, co-founder of Dremio, unveils a streamlined approach to DataOps that champions simplicity, data quality, and self-service, all essential for powering AI innovations. By integrating Apache Iceberg and Dremio’s open “git-for-data” model, the keynote showcases how Dremio transforms data…

Read more ->

Subsurface Session

Looking Forward-Apache Iceberg and the Iceberg Ecosystem

This panel discussion assembles experts from the Iceberg community and related projects to address the evolution and future directions of the Iceberg specification.”Looking Forward: Apache Iceberg and the Iceberg Ecosystem” aims to shed light on the latest developments, challenges, and…

Read more ->

Blog Post

Unifying Snowflake, Azure, AWS and Google Based Data Marketplaces and Data Sharing with Dremio

Data marketplaces have become invaluable resources for enriching internal data. These platforms offer a wealth of datasets that can enhance analytics and decision-making processes. However, a significant challenge arises as each marketplace typically requires you to access data from their specific storage solutions. This often necessitates moving data into their systems or transferring their data […]

Read more ->

Alex Merced's Articles and Resources

Blog Post

Comparing Apache Iceberg to Other Data Lakehouse Solutions

Gnarly Data Waves Episode

Apache Iceberg Lakehouse Crash Course – What is a Data Lakehouse and What is a Table Format?

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – Ingesting Data into Apache Iceberg with Apache Spark

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – Ingesting Data into Apache Iceberg with Apache Spark

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – Versioning with Apache Iceberg

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – The Role of Apache Iceberg Catalogs

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – Streaming with Apache Iceberg

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – Optimizing Apache Iceberg Tables

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – Understanding Apache Iceberg’s Partitioning Features

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – The Read and Write Process for Apache Iceberg Tables

Gnarly Data Waves Episode

An Apache Iceberg Lakehouse Crash Course – The Architecture of Apache Iceberg, Apache Hudi, and Delta Lake

Blog Post

Apache Iceberg Crash Course: What is a Data Lakehouse and a Table Format?

Gnarly Data Waves Episode

Build the next-generation Iceberg lakehouse with Dremio and NetApp

Blog Post

The Unified Apache Iceberg Lakehouse: Self Service & Ease of Use

Blog Post

The Unified Lakehouse: Performant Data Access

Blog Post

The Unified Apache Iceberg Lakehouse: Unified Analytics

Blog Post

Enhancing your Snowflake Data Warehouse with the Dremio Lakehouse Platform

Blog Post

How Apache Iceberg is Built for Open Optimized Performance

Blog Post

The Value of Dremio’s Semantic Layer and The Apache Iceberg Lakehouse to the Snowflake User

Blog Post

What is Data Virtualization? What makes an Ideal Data Virtualization Platform?

Gnarly Data Waves Episode

Best of Subsurface 2024

Blog Post

The Nessie Ecosystem and the Reach of Git for Data for Apache Iceberg

Blog Post

The Who, What and Why of Data Reflections and Apache Iceberg for Query Acceleration

Blog Post

The Evolution of Apache Iceberg Catalogs

Subsurface Session

Demystifying Data Governance: How Dremio Enables Governed Data Sharing

Subsurface Session

Unleashing Data Agility with Virtual Data Marts and ZeroETL: The End of ETL as We Know It

Subsurface Session

Best Practices for Building an Iceberg Data Lakehouse with Dremio

Subsurface Session

Next-Gen DataOps with Iceberg & Git for Data

Subsurface Session

Looking Forward-Apache Iceberg and the Iceberg Ecosystem

Blog Post

Unifying Snowflake, Azure, AWS and Google Based Data Marketplaces and Data Sharing with Dremio

Get Started Free

See Dremio in Action

Talk to an Expert

Ready to Get Started?