Dremio Blog: Open Data Insights
-
Dremio Blog: Open Data Insights
Apache Polaris 1.5.0: Deep-Dive Into the Future of Open Data Catalogs
The release of Apache Polaris 1.5.0 marks a significant step forward in the project's evolution. This release introduces enterprise-grade security integrations, expanded catalog federation, advanced credential vending, and key performance optimizations. -
Dremio Blog: Open Data InsightsAgentic Lakehouse Architecture: The Four Technical Layers
This composability is what makes the Agentic Lakehouse architecture viable long-term. As Iceberg V3 adoption grows and the Polaris REST Catalog becomes the universal standard for catalog interoperability, adding a new engine or a new AI framework to your stack becomes a configuration change, not a migration project. -
Dremio Blog: Open Data InsightsPerformance and Apache Iceberg’s Metadata
The single biggest performance advantage of Iceberg over raw data lakes is not a clever algorithm or a faster codec. It is metadata-driven data skipping. By the time a query engine begins scanning actual Parquet files, Iceberg's metadata has already eliminated 90-99% of the files from consideration. -
Dremio Blog: Open Data InsightsApache Iceberg V2 vs V3: What Changed and What It Means for Your Tables
Apache Iceberg V3 is a meaningful advancement over V2, not a version bump for its own sake. Deletion vectors address the fundamental I/O cost of merge-on-read that V2 delete file accumulation creates. The Variant type eliminates one of the most common workarounds in modern data pipelines: storing JSON as strings and parsing at query time. -
Dremio Blog: Open Data Insights
Migrate Delta Lake to Apache Iceberg: Step-by-Step Guide
The Iceberg ecosystem is consolidating fast. REST Catalog interoperability, growing AI tooling, and the Apache governance model mean that every month you stay on Delta Lake, you are working against the direction of the industry. The migration investment pays off in engine flexibility, catalog portability, and access to a growing set of tools that assume Iceberg as the standard. -
Dremio Blog: Open Data Insights
What’s New in Apache Iceberg 1.11.0
Apache Iceberg 1.11.0 delivers on two fronts. The File Format API is an architectural investment whose full payoff comes over the next year or two as new format plugins ship, but it also consolidates and cleans up the engine's internal format handling today. -
Dremio Blog: Open Data Insights
What is a model context protocol (MCP) server?
Learn what an MCP server is, how it works, and why it powers agentic AI, real-time data access, and scalable workflows for enterprises. -
Dremio Blog: Open Data Insights
Agentic Analytics vs Traditional BI Tools: What Do You Need for the Future?
From the original co-creators of Apache Polaris and Apache Arrow, Dremio is the only lakehouse that meets the needs of AI agents and humans through autonomous optimization, a unified semantic layer, and Zero-ETL federation. -
Dremio Blog: Open Data Insights
Definitive Guide to the Data Lakehouse
The data lakehouse resolves the core tradeoff that made the warehouse-vs-lake debate so frustrating. -
Dremio Blog: Open Data Insights
Semantic Layer 101
This guide explores what semantic layers are, their benefits and how they’re implemented within your enterprise data stack. -
Dremio Blog: Open Data Insights
The Metadata Structure of Modern Table Formats
The metadata structure of a table format determines everything: how fast queries start planning, how efficiently concurrent writes are handled, how schema changes propagate, and how much overhead accumulates over time. -
Dremio Blog: Open Data Insights
Apache Polaris: The Catalog Standard for Iceberg Lakehouses and Agentic Analytics
Polaris is production-ready today. Organizations are already using its RBAC, catalog federation, credential vending, Iceberg SQL views, and generic tables to govern multi-engine lakehouses at scale. -
Dremio Blog: Open Data Insights
What Are Table Formats and Why Were They Needed?
A table format is a specification that defines how to organize metadata about data files so that query engines can treat them as reliable, transactional tables. It sits between the query engine and the physical files. -
Dremio Blog: Open Data Insights
Semantic Layer: The Definitive Guide
The semantic layer is not a one-time project. It is a living system that grows with your organization's data needs. Start small, prove value on the metrics that matter most, and expand from there. -
Dremio Blog: Open Data Insights
What “Apache Iceberg Native” Actually Means
It is a great thing that so many platforms now support Apache Iceberg. More support means more flexibility for everyone. But if your intention is to make Iceberg your primary analytics format, then "supports Iceberg" and "built for Iceberg" lead to very different outcomes.
- « Previous Page
- 1
- 2
- 3
- 4
- …
- 14
- Next Page »