Mark Shainman

Mark Shainman Dremio Author & Contributor
Principal Product Marketing Manager

Mark Shainman is a Principal Product Marketing Manager for Dremio. He has  spent more than 20-years working in both the analytics as well as privacy, governance, and security space. He has worked with numerous data products and on numerous initiatives,  including database migrations, data warehousing, big data, SQL on Hadoop, data lakes, federated query access, data cataloging, privacy compliance and, now, the data lakehouse.

Mark Shainman's Articles and Resources

Blog Post

Hadoop Modernization on AWS with Dremio: The Path to Faster, Scalable, and Cost-Efficient Data Analytics

As businesses generate increasing volumes of data, the need for efficient, flexible, and cost-effective data management solutions has never been greater. Legacy Hadoop environments, though groundbreaking when first introduced, often struggle to keep up with the demands of modern data and analytic workloads. From high costs associated with licensing to the complexity of managing Hadoop […]

Read more ->

Blog Post

Adopting a Hybrid Lakehouse Strategy

Enterprises have revolutionized analytics by leveraging the cloud’s scalability and flexibility. Yet, despite the promise of the cloud, many organizations find that a cloud-only strategy doesn’t always meet their performance, cost, or governance expectations. As the complexities of multi-cloud and hybrid data environments grow, it’s time to consider a hybrid lakehouse strategy that combines the […]

Read more ->

Gnarly Data Waves Episode

Moving Past Hadoop to a Modern Data Platform with Pure Storage & Dremio

Discover how Dremio’s Hybrid Iceberg Lakehouse, paired with Pure Storage’s data platform, empowers your teams to accelerate access to insights, simplify data management, and reduce operational costs. Learn best practices for moving from Hadoop to a modern object storage based…
Read more ->

Blog Post

Maximizing Value: Lowering TCO and Accelerating Time to Insight with a Hybrid Iceberg Lakehouse

Organizations are constantly pressured to unlock data insights quickly and efficiently while controlling costs. However, as businesses amass ever-increasing volumes of data across multiple environments—both on-premises and in the cloud—they encounter significant challenges with traditional data architectures. A Hybrid Iceberg Lakehouse, such as the one offered by Dremio, delivers substantial Total Cost of Ownership (TCO) […]

Read more ->

Blog Post

Enabling AI Teams with AI-Ready Data: Dremio and the Hybrid Iceberg Lakehouse

Artificial Intelligence (AI) has become essential for modern enterprises, driving innovation across industries by transforming data into actionable insights. However, AI’s success depends heavily on having consistent, high-quality data readily available for experimentation and model development. It is estimated that data scientists spend 80+% of their time on data acquisition and preparation, compared to model […]

Read more ->

Blog Post

Accelerating Analytical Insight – The NetApp & Dremio Hybrid Iceberg Lakehouse Reference Architecture

Organizations are constantly seeking ways to optimize data management and analytics. The Dremio and NetApp Hybrid Iceberg Lakehouse Reference Architecture brings together Dremio’s Unified Lakehouse Platform and NetApp’s advanced data storage solutions to create a high-performance, scalable, and cost-efficient data lakehouse platform. With this solution combining NetApp’s advanced storage technologies with Dremio’s high-performance lakehouse platform, […]

Read more ->

Gnarly Data Waves Episode

What’s New in Dremio: Improved Automation, Performance + Catalog for Iceberg Lakehouses

Discover the new Dremio capabilities designed to make your Apache Iceberg data lakehouse the most efficient, scalable, and manageable platform for analytics and AI.  We’ll cover enhancements in performance, data ingestion, data processing, and federated query capabilities, aimed at helping…
Read more ->

Blog Post

What’s New in Dremio,  Enhanced Performance with Reflection improvements, Result Set Caching and Merge-on-Read. 

Dremio’s latest version sets a new standard in the overall performance for lakehouse platforms. This release underscores Dremio’s commitment to providing the most high performance Iceberg lakehouse platform, positioning it as the market’s premier lakehouse analytics platform. Reflection Enhancements  A Reflection In Dremio, is an optimized relational cache that takes advantage of the platform’s advanced […]

Read more ->

Blog Post

What’s New in Dremio, Accelerating Cross-Database Access Control and Workload Management with User Impersonation 

In today’s data-driven world, organizations are increasingly dealing with diverse data environments, encompassing cloud, multi-cloud, on-premises, and hybrid. Efficiently managing and querying data across these varied landscapes can be challenging, particularly when it comes to access control and workload management. Dremio has introduced significant improvements in query federation capabilities, simplifying data access and ensuring robust […]

Read more ->

Blog Post

What’s New in Dremio: Automatic Iceberg Data Ingestion with Auto Ingest Pipelines 

Dremio continues to innovate and enhance the capabilities of Data Lakehouse environments with its latest feature, Auto Ingest Pipelines for Iceberg tables. This cutting-edge functionality for both Dremio Enterprise Software and Dremio Cloud changes the way organizations handle data ingestion from Amazon S3 into Iceberg tables in  Lakehouse environments. What is Automatic Iceberg Data Ingestion? […]

Read more ->

Blog Post

What’s New in Dremio 25.1: Improved Performance, Data Ingestion, and Federated Access for Apache Iceberg Lakehouses

In today’s data-driven world, businesses face the constant challenge of managing and analyzing data across various environments—cloud, on-premises, and hybrid. With our latest release of Dremio 25.1, we continue to innovate and deliver features that enhance performance, streamline data ingestion, and improve federated query access. This release introduces improvements that collectively drive better performance, efficiency, […]

Read more ->

Blog Post

Modernizing Your Hadoop Infrastructure with Dremio and NetApp

IntroductionIn the era of big data, organizations are increasingly recognizing the limitations of traditional Hadoop infrastructures. As data volumes grow and analytics requirements become more complex, the need for a more agile, scalable, and cost-effective solution has never been greater. Enter the data lakehouse architecture—a modern approach combining the best data lakes and data warehouses. […]

Read more ->

Blog Post

Why Modernize Your Hadoop Data Lake with Dremio and MinIO?

Hadoop, once celebrated as a groundbreaking framework for processing massive datasets, has become increasingly burdensome for most enterprises. Despite its initial advantages, Hadoop is now recognized as slow, costly, and time-consuming to manage. While it was once cheap to provision, the tight coupling of compute and storage, combined with high maintenance costs and inefficiencies, has […]

Read more ->

Blog Post

Hybrid Iceberg Lakehouse Infrastructure Solutions: VAST Data

The data lakehouse is an architectural pattern that leverages storage layers like Hadoop or object storage as the center of gravity for your data. Using tools like Dremio, you can create a decoupled, modular data warehouse. The key component connecting platforms like Dremio to your data lake is a data lakehouse table format such as […]

Read more ->

Blog Post

On-Prem and Cloud: The Why of a Hybrid Iceberg Lakehouse

Part 1: The Challenge for Organizations Organizations must enable data users to leverage and gain insights from their data seamlessly. The goal is to drive business value through comprehensive data analysis, regardless of where the data resides: on-premises, in the cloud, or hybrid cloud environments. While there is a significant push towards cloud adoption, many […]

Read more ->

Blog Post

Why a Cyber Lakehouse? | Dremio & VAST Data: Transforming Cybersecurity

Over the years, cybersecurity capabilities have evolved from single-point solutions to comprehensive cyber data platforms utilizing advanced analytic-based technologies. With the exponential growth in the volume, variety, and complexity of cyber-relevant data, cybersecurity professionals must leverage cutting-edge data platform technologies to address their needs effectively and economically. In today’s digital age, virtually all data holds […]

Read more ->

Blog Post

Dremio vs. Starburst Data: The Truth of Why Companies Choose Dremio

Two prominent solutions have emerged in the on-prem, cloud, and hybrid-cloud lakehouse space: Dremio and Starburst Data. Both platforms offer unique features and benefits. On the surface, the platforms look fairly similar, with federated query capability, object store connectivity, SQL on Hadoop functionality, Iceberg support, and support for hybrid cloud environments. A deeper dive reveals […]

Read more ->

Blog Post

Advancing the Capabilities of the  Premier Data Lakehouse Platform for Apache Iceberg 

With the latest release of Dremio, 25.0 we are helping accelerate the adoption and benefits of Apache Iceberg, while  bringing  your users closer to the data with lakehouse flexibility, scalability and performance at a fraction of the cost.  We are excited to announce some of the new features that improve scalability, manageability, ease of use […]

Read more ->

Gnarly Data Waves Episode

What’s New in Dremio: New Capabilities for the Best Apache Iceberg Lakehouse

Join our upcoming webinar to discover the new Dremio capabilities designed to make your Apache Iceberg data lakehouse the most efficient, scalable, and manageable platform for analytics and AI.  We’ll cover enhancements for data ingestion, data processing, and data optimization…
Read more ->
1200x628_Gnarly Data Waves ep 42.1200x628_Gnarly Data Waves ep 42.

Gnarly Data Waves Episode

What’s new in Dremio: New Gen-AI capabilities, advances for 100% query success, plus now on Azure

Transcript Note: This transcript was created using speech recognition software. While it has been reviewed by human transcribers, it may contain errors. Opening Alex Merced: Well, with no further ado, let’s begin with today’s adventure, and we’re gonna be talking about what’s new in Dremio, which includes new generative AI capabilities, advances for 100% query […]

Learn what’s new in Dremio - and how you can accelerate self-service analytics at scale - including new Gen AI capabilities, Dremio Cloud SaaS on Microsoft Azure, advances to ensure 100% query reliability, and expanded Apache Iceberg capabilities to streamline…
Read more ->
get started

Get Started Free

No time limit - totally free - just the way you like it.

Sign Up Now
demo on demand

See Dremio in Action

Not ready to get started today? See the platform in action.

Watch Demo
talk expert

Talk to an Expert

Not sure where to start? Get your questions answered fast.

Contact Us

Ready to Get Started?

Enable the business to create and consume data products powered by Apache Iceberg, accelerating AI and analytics initiatives and dramatically reducing costs.