Alex Merced

Developer Advocate, Dremio

Alex Merced is a developer advocate for Dremio and has worked as a developer and instructor for companies like GenEd Systems, Crossfield Digital, CampusGuard and General Assembly. Alex is passionate about technology and has put out tech content on outlets such as blogs, videos and his podcasts Datanation and Web Dev 101. Alex Merced has contributed a variety of libraries in the Javascript & Python worlds including SencilloDB, CoquitoJS, dremio-simple-query and more.

Alex Merced's Articles and Resources

Gnarly Data Waves Episode

Building a Data Science Platform on Apache Iceberg and Nessie

Discover the future of data science and machine learning pipelines with Jacopo Tagliabue of Bauplan Labs in this webinar. Learn why modern data platforms are embracing Apache Iceberg and Nessie, and explore the transformative benefits of Nessie's git-like features for…
Read more ->

Blog Post

Intro to Dremio, Nessie, and Apache Iceberg on Your Laptop

We’re always looking for ways to better handle and save money on our data. That’s why the “data lakehouse” is becoming so popular. It offers a mix of the flexibility of data lakes and the ease of use and performance of data warehouses. The goal? Make data handling easier and cheaper. So, how do we […]

Read more ->

Blog Post

Exploring the Architecture of Apache Iceberg, Delta Lake, and Apache Hudi

In the age of data-centric applications, storing, accessing, and managing data can significantly influence an organization’s ability to derive value from a data lakehouse. At the heart of this conversation are data lakehouse table formats, which are metadata layers that allow tools to interact with data lake storage like a traditional database. But why do […]

Read more ->

Blog Post

How to Create a Lakehouse with Airbyte, S3, Apache Iceberg, and Dremio

In the ever-evolving data landscape, the need for robust and scalable data storage solutions is growing exponentially. The essence of data-driven decisions lies in the capability to harness vast amounts of structured and unstructured data from various sources, process them, and prepare them for analysis. In this realm, the concept of the “lakehouse” has emerged […]

Read more ->

Gnarly Data Waves Episode

The Who, What and Why of Data Lakehouse Table Formats

Dive into the transformative world of Data Lakehouse table formats, exploring Apache Iceberg, Delta Lake, and Apache Hudi. Learn their pivotal roles in reshaping data storage, analytics, and the unparalleled advantages they offer.
Read more ->

Gnarly Data Waves Episode

ELT, ETL and the Dremio Data Lakehouse

Transcript Note: This transcript was created using speech recognition software. While it has been reviewed by human transcribers, it may contain errors. Opening Alex Merced: Hey, everybody. This is Alex Merced, developer/advocate here at Dremio, and your host here every week on Gnarly Data Waves. This week, I will be presenting about ETL, ELT, and […]

Unlock the potential of data engineering in our "ELT, ETL & the Dremio Data Lakehouse" webinar! Discover how Dremio's no-copy architecture revolutionizes ETL & ELT patterns, optimizing data processing and cutting costs.
Read more ->

Blog Post

5 Use Cases for the Dremio Lakehouse

The Dremio Data Lakehouse has emerged as a game-changing solution in data analytics, combining the best of data lakes and data warehouses into a unified architecture. With its versatile capabilities, Dremio opens up a world of possibilities for organizations across various use cases in the realm of either modernizing or upgrading their current data systems, […]

Read more ->

Blog Post

10 Data Quality Checks in SQL, Pandas and Polars

In the era of big data and analytics, the quality of data plays a critical role in making informed decisions and extracting meaningful insights. However, ensuring data quality can be complex, requiring thorough checks and validations.  In this blog article, we explore 10 essential data quality checks using three powerful tools: SQL, Pandas, and Polars. […]

Read more ->

Blog Post

Using Flink with Apache Iceberg and Nessie

Flink is a supercharged tool for processing data in real-time or in batches. It’s open source and has a unified programming model, so you can build some serious data processing pipelines. But here’s where things get interesting. When you bring Apache Iceberg and Project Nessie into the mix, Flink becomes even more awesome. Iceberg is […]

Read more ->

Gnarly Data Waves Episode

Unlock the Potential of Data Analytics with Dremio and DuckDB

Learn how to leverage cutting edge technologies like Dremio and DuckDB together to push your data lakehouse to new heights.
Read more ->

Gnarly Data Waves Episode

Apache Iceberg Office Hours

Get all your Apache Iceberg questions answered at Apache Iceberg office hours. Questions on architecture, migration and anything else are welcomed!
Read more ->

Gnarly Data Waves Episode

Versioning Data in the Data Lakehouse (File, Table and Catalog Versioning)

Learn about the benefits of versioning in integrating and deploying new data, and the differences between file, table and catalog versioning.
Read more ->

Blog Post

How to Convert JSON Files Into an Apache Iceberg Table with Dremio

Apache Iceberg is an open table format that enables robust, affordable, and quick analytics on the data lakehouse and is poised to change the data industry in ways we can only begin to imagine. Check out our Apache Iceberg 101 course to learn all the nuts and bolts about Iceberg. By storing your data in […]

Read more ->

Blog Post

Deep Dive Into Configuring Your Apache Iceberg Catalog with Apache Spark

Apache Iceberg is a data lakehouse table format that has been taking the data world by storm with robust support from tools like Dremio, Fivetran, Airbyte, AWS, Snowflake, Tabular, Presto, Apache Flink, Apache Spark, Trino, and so many more. Although one of the tools most data professionals use is Apache Spark and many introductory tutorials […]

Read more ->

Blog Post

Using Generative AI as a Data Engineer

Data engineering is an essential part of data science and analytics, as it involves transforming raw data into a usable form. With the rapid advancement of generative AI, it is becoming increasingly important for data engineers to know its capabilities and potential implications. Generative AI is a type of artificial intelligence (AI) used to create […]

Read more ->

Gnarly Data Waves Episode

Dremio and Data Lakehouse Table Formats (Apache Iceberg, Delta Lake and Apache Hudi & Dremio)

Learn about how Dremio’s Lakehouse Platform can work with different Lakehouse formats at this presentation.
Read more ->

Blog Post

3 Ways to Use Python with Apache Iceberg

Apache Iceberg is a data lake table format that is quickly growing its adoption across the data space. If you want to become more familiar with Apache Iceberg, check out this Apache Iceberg 101 article with everything you need to go from zero to hero. If you are a data engineer, data analyst, or data […]

Read more ->

Blog Post

Using DuckDB with Your Dremio Data Lakehouse

Storage, compute, and regulatory costs can really add up when it comes to working with and managing your data. In traditional proprietary data warehouses, you must store your data in proprietary formats, organized in proprietary catalogs, to be queried with a proprietary engine. The result is vendor lock-in which over time, allows vendors to price […]

Read more ->

Blog Post

3 Ways to Convert a Delta Lake Table Into an Apache Iceberg Table

What’s a Table Format? One of the significant trends in data architecture is the idea of the data lakehouse, which combines the benefits of the data lake and the data warehouse, as exemplified by the following image: The centerpiece of this architecture is the table format, a metadata layer on top of your data lake […]

Read more ->

Blog Post

How to Convert CSV Files into an Apache Iceberg table with Dremio

Apache Iceberg is an open table format that enables robust, affordable, and quick analytics on the data lakehouse and is poised to change the data industry in ways we can only begin to imagine. Check out our Apache Iceberg 101 course to learn all the nuts and bolts about Iceberg. The bottom line: Converting your […]

Read more ->

Gnarly Data Waves Episode

How to Modernize Hive to the Data Lakehouse with Dremio and Apache Iceberg

We all want to overcome the many challenges we find with data drift, infra costs and performance. In this talk we’ll discuss the path to taking your hadoop based data lake and using Dremio and Apache Iceberg to modernize it…
Read more ->

Gnarly Data Waves Episode

Apache Iceberg Office Hours

Get all your Apache Iceberg questions answered at Apache Iceberg office hours. Questions on architecture, migration and anything else are welcomed!
Read more ->

Subsurface Session

How to Migrate Your Tables to Apache Iceberg

An Iceberg based data lakehouse has several benefits in not just query performance and cost, but in scalability, consistency and more. Migrating your existing data lake or data lakehouse to Iceberg doesn’t have to be difficult, but there are some…
Read more ->

Subsurface Session

CI/CD on the Lakehouse: Making Data Changes and Repair Safe and Easy

In Software development developers use Continuous Integration and Continuous Deployment as a technique to automate the integration of new code safely and quickly. A challenge in the data world is often integrating new data ingested from batches and streams with…
Read more ->

Gnarly Data Waves Episode

Apache Iceberg Office Hours

Get all your Apache Iceberg questions answered at Apache Iceberg office hours. Questions on architecture, migration and anything else are welcomed!
Read more ->

Gnarly Data Waves Episode

Best Practices for Optimizing Tableau Dashboards with Dremio

Join Nick Brisoux, Senior Director of Product Management at Tableau and Brett Roberts, Principal Alliances Solutions Architect at Dremio, to learn how Dremio helps Tableau users accelerate access to data, including cloud data lakes, and how Dremio can dramatically improve…
Read more ->

Gnarly Data Waves Episode

Migrating from Delta Lake to Iceberg

Iceberg has been gaining wide adoption in the industry as the de facto open standard for data lakehouse table formats. Join Dremio Developer Advocate Alex Merced as we help you learn the options and strategies you can employ when migrating…
Read more ->

Blog Post

Connecting Tableau to Apache Iceberg Tables with Dremio

The Apache Iceberg format has taken the data lakehouse world by storm, becoming the keystone pillar of many firms’ data infrastructure. This article shows you how you can connect your Apache Iceberg tables to tools like Tableau so you can generate BI dashboards directly from your Iceberg tables without the need for cubes or extracts. […]

Read more ->

Blog Post

Getting Started with Project Nessie, Apache Iceberg, and Apache Spark Using Docker

In today’s modern data lakes, you work with a separation of data and metadata with open table formats like Apache Iceberg giving you vastly improved query performance, the ability to time-travel, evolve your table’s partitions/schema, and much more. Open table formats rely on metadata catalogs to track where the metadata lives so engines can access […]

Read more ->

Blog Post

Apache Iceberg FAQ

Answers to many of the most common questions about Apache Iceberg.

Read more ->

Get Started Free

No time limit - totally free - just the way you like it.

Sign Up Now

See Dremio in Action

Not ready to get started today? See the platform in action.

Watch Demo

Talk to an Expert

Not sure where to start? Get your questions answered fast.

Contact Us