Mark Shainman

Mark Shainman Dremio Author & Contributor
Principal Product Marketing Manager

Mark Shainman is a Principal Product Marketing Manager for Dremio. He has  spent more than 20-years working in both the analytics as well as privacy, governance, and security space. He has worked with numerous data products and on numerous initiatives,  including database migrations, data warehousing, big data, SQL on Hadoop, data lakes, federated query access, data cataloging, privacy compliance and, now, the data lakehouse.

Mark Shainman's Articles and Resources

Blog Post

Dremio Advances the Modern Iceberg Lakehouse with Iceberg V3 Support

For years, the promise of the open lakehouse was simple: store your data once, query it with any tool, and never get locked into a single vendor’s ecosystem. Apache Iceberg made that promise real. It became the industry-standard table format because it worked, it was open, and it kept getting better. Iceberg version 3 (V3) […]

Read more ->

Blog Post

The Lakehouse Is the Modern Data Warehouse

The data warehouse is not a product. It never was. It is an architectural intent, a set of goals that organizations have pursued for nearly four decades. And like every architectural intent, the technology that delivers on it must evolve as the demands of the era change. For a generation of data leaders, the data […]

Read more ->

Blog Post

How Dremio Cloud Secures the Agentic Lakehouse: Capabilities and Certifications

Every time you move data between a lake, a warehouse, and a downstream application, you multiply your security risk. Leaving data in place is the safest approach, but historically, accessing it directly meant sacrificing performance and governance. The agentic lakehouse pattern changes that by allowing high-speed, governed analytics directly on your storage. Dremio Cloud is […]

Read more ->

Blog Post

One Click with Dremio’s Claude Connector Using MCP

Most AI assistants fail at data analysis because they lack business context. You can build the most complex retrieval pipelines imaginable, but without a semantic layer and direct access to your data, large language models guess metrics and hallucinate schemas. Today, we fix that. You can now connect Claude to Dremio as your dedicated data […]

Read more ->

Blog Post

Claude + Dremio: Instantly Connect and Start Getting Insights

Today we’re making it super simple to use Claude, Claude Cowork, and Claude Code with Dremio.  The new Claude connector for Dremio Cloud lets anyone instantly connect Anthropic’s Claude to their lakehouse.   Open Claude, connect to Dremio, and start asking questions in plain English. That’s it. No custom integrations, no middleware, no project setup. When […]

Read more ->

Blog Post

Federated Semantics: The Foundation Enterprise Agentic AI Actually Needs

Most enterprise AI projects fail at the same place. Not the model. Not the infrastructure. The data. Specifically: the AI doesn’t understand the data well enough to act on it accurately, and it can’t reach most of the data anyway. These two problems compound each other. A model that can only see 30% of your […]

Read more ->

Blog Post

How Apache Polaris Powers Dremio’s Open Catalog

Apache Polaris provides the open foundation, but enterprises running enterprise-wide, AI-driven platforms need more than a standards-compliant catalog service. They need enterprise governance, automation, and business context that AI systems can reason over. As original co-creator of Apache Polaris and one of the project’s largest contributors, Dremio uses Polaris as the foundation of Dremio’s Open […]

Read more ->

Blog Post

Apache Polaris Graduates to a Top-Level Apache Project

Apache Polaris is now a Top-Level Project at the Apache Software Foundation. For anyone building on Apache Iceberg, this is one of the most important catalog milestones since the REST Catalog spec itself. Graduation means Polaris has cleared Apache’s bar for contributor diversity, governance maturity, and long-term sustainability. It’s no longer incubating. It’s production-grade open […]

Read more ->

Blog Post

Driving Open Source and Open Standard Innovation at Dremio

Dremio is a commercial platform, and we’re straightforward about that. But the standards and projects that power it are genuinely open, and Dremio has been an active contributor to building them, not just consuming them. Apache Arrow, Apache Iceberg, and Apache Polaris all have Dremio fingerprints on their design, specification, and governance. That work matters […]

Read more ->

Blog Post

What Forrester’s 2026 Data Lakehouse Landscape Signals About the Market — And Where Dremio Fits

The data lakehouse market has moved past the “is this real?” phase. According to Forrester’s The Data Lakehouses Landscape, Q1 2026, lakehouses are now the default architectural choice for modern analytics and AI workloads, driven by the need to simplify data architectures, control costs, and support modern analytics and AI workloads. For technology leaders navigating […]

Read more ->

Blog Post

Dremio’s latest release delivers AI-Driven Intelligence with the Agentic Lakehouse

Companies are racing to operationalize agentic AI, yet the final process of getting from data to decision is extremely difficult,  requiring data integration, tuning, and governance management. With Dremio’s latest release, we remove these blockers by putting natural‑language intelligence, explainability, and self‑optimizing performance directly into the lakehouse experience. You get clarity and control without copies, […]

Read more ->

Blog Post

Transform Studio: Free, Open Source Pipelines and Data Quality for the Dremio Community

Key Takeaways Built for the Dremio Community Transform Studio for Dremio, is a free and open source pipeline builder and data quality tool built for the Dremio community. Whether you run Dremio Cloud or a self-hosted deployment, Transform Studio gives analysts and data engineers a visual workspace to build transformation pipelines, monitor data quality, and […]

Read more ->

Blog Post

Dremio vs. Redshift: The Cost Advantage of the Dremio Agentic Lakehouse

The New Economics of Data Cloud data warehouses like Amazon Redshift were built for a world that no longer exists. In that earlier era, organizations focused primarily on structured business intelligence, static dashboards, and predictable workloads. Data was tightly controlled, compute resources were fixed, and dynamic scalability for rapidly changing workloads was not a concern. […]

Read more ->

Blog Post

Why Companies Are Migrating from Redshift to Dremio

Companies today are under constant pressure to deliver faster insights, support advanced analytics, and enable AI-driven innovation. Many organizations chose  Amazon Redshift as their cloud data warehouse. However, as data volumes grow and workloads change, Redshift’s legacy warehouse architecture is not meeting their needs—driving many organizations to consider alternatives. Dremio’s intelligent lakehouse platform: a modern, […]

Read more ->

Blog Post

 Dremio’s Leading the Way in Active Data Architecture 

Modern data teams are under pressure to deliver faster insights, support AI initiatives, and reduce architectural complexity. To meet these demands, more organizations are adopting active data architectures—frameworks that unify access, governance, and real-time analytics across hybrid environments. In the newly released Dresner 2025 Active Data Architecture Report, Dremio was ranked #1—recognized as a top […]

Read more ->

Blog Post

Accelerate Insights While Reducing TCO with An Intelligent Lakehouse Platform

Enterprises today face increasing pressure to extract insights from data quickly while controlling spend. Yet, as data volumes explode across cloud and on-prem environments, traditional architectures often fall short—resulting in higher costs, rigid pipelines, and slower decision-making. The Dremio Intelligent Lakehouse Platform addresses these challenges by delivering faster insights and significant total cost of ownership […]

Read more ->

Blog Post

Enabling companies with AI-Ready Data: Dremio and the Intelligent Lakehouse Platform

Artificial Intelligence (AI) has become essential for modern enterprises, driving innovation across industries by transforming data into actionable insights. However, AI’s success depends heavily on having consistent, high-quality data readily available for experimentation and model development. It is estimated that data scientists spend 80+% of their time on data acquisition and preparation, compared to model […]

Read more ->
Thumbnail image for a Dremio blog post with graphics related to data analytics.Thumbnail image for a Dremio blog post with graphics related to data analytics.

Blog Post

Hadoop Modernization on AWS with Dremio: The Path to Faster, Scalable, and Cost-Efficient Data Analytics

As businesses generate increasing volumes of data, the need for efficient, flexible, and cost-effective data management solutions has never been greater. Legacy Hadoop environments, though groundbreaking when first introduced, often struggle to keep up with the demands of modern data and analytic workloads. From high costs associated with licensing to the complexity of managing Hadoop […]

Read more ->

Blog Post

Adopting a Hybrid Lakehouse Strategy

Enterprises have revolutionized analytics by leveraging the cloud’s scalability and flexibility. Yet, despite the promise of the cloud, many organizations find that a cloud-only strategy doesn’t always meet their performance, cost, or governance expectations. As the complexities of multi-cloud and hybrid data environments grow, it’s time to consider a hybrid lakehouse strategy that combines the […]

Read more ->

Gnarly Data Waves Episode

Moving Past Hadoop to a Modern Data Platform with Pure Storage & Dremio

Discover how Dremio’s Hybrid Iceberg Lakehouse, paired with Pure Storage’s data platform, empowers your teams to accelerate access to insights, simplify data management, and reduce operational costs. Learn best practices for moving from Hadoop to a modern object storage based…
Read more ->

Blog Post

Maximizing Value: Lowering TCO and Accelerating Time to Insight with a Hybrid Iceberg Lakehouse

Organizations are constantly pressured to unlock data insights quickly and efficiently while controlling costs. However, as businesses amass ever-increasing volumes of data across multiple environments—both on-premises and in the cloud—they encounter significant challenges with traditional data architectures. A Hybrid Iceberg Lakehouse, such as the one offered by Dremio, delivers substantial Total Cost of Ownership (TCO) […]

Read more ->

Blog Post

Enabling AI Teams with AI-Ready Data: Dremio and the Hybrid Iceberg Lakehouse

Artificial Intelligence (AI) has become essential for modern enterprises, driving innovation across industries by transforming data into actionable insights. However, AI’s success depends heavily on having consistent, high-quality data readily available for experimentation and model development. It is estimated that data scientists spend 80+% of their time on data acquisition and preparation, compared to model […]

Read more ->

Blog Post

Accelerating Analytical Insight – The NetApp & Dremio Hybrid Iceberg Lakehouse Reference Architecture

Organizations are constantly seeking ways to optimize data management and analytics. The Dremio and NetApp Hybrid Iceberg Lakehouse Reference Architecture brings together Dremio’s Unified Lakehouse Platform and NetApp’s advanced data storage solutions to create a high-performance, scalable, and cost-efficient data lakehouse platform. With this solution combining NetApp’s advanced storage technologies with Dremio’s high-performance lakehouse platform, […]

Read more ->

Gnarly Data Waves Episode

What’s New in Dremio: Improved Automation, Performance + Catalog for Iceberg Lakehouses

Discover the new Dremio capabilities designed to make your Apache Iceberg data lakehouse the most efficient, scalable, and manageable platform for analytics and AI.  We’ll cover enhancements in performance, data ingestion, data processing, and federated query capabilities, aimed at helping…
Read more ->

Blog Post

What’s New in Dremio, Enhanced Performance with Reflection improvements, Result Set Caching and Merge-on-Read.

Dremio’s latest version sets a new standard in the overall performance for lakehouse platforms. This release underscores Dremio’s commitment to providing the most high performance Iceberg lakehouse platform, positioning it as the market’s premier lakehouse analytics platform. Reflection Enhancements  A Reflection In Dremio, is an optimized relational cache that takes advantage of the platform’s advanced […]

Read more ->

Blog Post

What’s New in Dremio, Accelerating Cross-Database Access Control and Workload Management with User Impersonation

In today’s data-driven world, organizations are increasingly dealing with diverse data environments, encompassing cloud, multi-cloud, on-premises, and hybrid. Efficiently managing and querying data across these varied landscapes can be challenging, particularly when it comes to access control and workload management. Dremio has introduced significant improvements in query federation capabilities, simplifying data access and ensuring robust […]

Read more ->

Blog Post

What’s New in Dremio: Automatic Iceberg Data Ingestion with Auto Ingest Pipelines

Dremio continues to innovate and enhance the capabilities of Data Lakehouse environments with its latest feature, Auto Ingest Pipelines for Iceberg tables. This cutting-edge functionality for both Dremio Enterprise Software and Dremio Cloud changes the way organizations handle data ingestion from Amazon S3 into Iceberg tables in  Lakehouse environments. What is Automatic Iceberg Data Ingestion? […]

Read more ->

Blog Post

What’s New in Dremio 25.1: Improved Performance, Data Ingestion, and Federated Access for Apache Iceberg Lakehouses

In today’s data-driven world, businesses face the constant challenge of managing and analyzing data across various environments—cloud, on-premises, and hybrid. With our latest release of Dremio 25.1, we continue to innovate and deliver features that enhance performance, streamline data ingestion, and improve federated query access. This release introduces improvements that collectively drive better performance, efficiency, […]

Read more ->

Blog Post

Modernizing Your Hadoop Infrastructure with Dremio and NetApp

IntroductionIn the era of big data, organizations are increasingly recognizing the limitations of traditional Hadoop infrastructures. As data volumes grow and analytics requirements become more complex, the need for a more agile, scalable, and cost-effective solution has never been greater. Enter the data lakehouse architecture—a modern approach combining the best data lakes and data warehouses. […]

Read more ->
get started

Get Started Free

No time limit - totally free - just the way you like it.

Sign Up Now
demo on demand

See Dremio in Action

Not ready to get started today? See the platform in action.

Watch Demo
talk expert

Talk to an Expert

Not sure where to start? Get your questions answered fast.

Contact Us

Make data engineers and analysts 10x more productive

Boost efficiency with AI-powered agents, faster coding for engineers, instant insights for analysts.