oct 14 – nov 13, 2025
Paris / Nuremberg / London / San Francisco / New York City
Oct 14
Oct 16
Oct 29
Nov 6
Nov 13
Saved the date
Subsurface, proudly presented by Dremio, is back—and bigger than ever.
This year’s event shines a spotlight on the latest innovations in the open data lakehouse ecosystem, with real-world use cases that bring technical depth and practical insights together. Subsurface is the global stage for data pioneers, blending cutting-edge research with hybrid lakehouse strategies that are shaping the future of data.
Over 18,000 data engineers, architects, and scientists from around the world have joined us at past events. Our speaker lineup has included experts from industry leaders like Apple, Netflix, Lyft, LinkedIn, TransUnion, Uber, Marsh McLennan, Adobe, AWS, Microsoft, Shell, and Wayfair—as well as emerging players like SpiceAI and Forcemetrics. We've also featured the original creators behind Apache Arrow, Apache Iceberg, Apache Parquet, Project Nessie, Pandas, and more.
This year, we’re bringing even more Dremio-focused sessions—offering a closer look at its architecture, performance gains, and role in the modern data stack.
Best of all, it’s free—and coming to a city near you. Don’t miss your chance to connect with peers, learn from industry leaders, and be part of the future of data.
Tuesday, october 14
Thurday, october 16
Wednesday, october 29
Thursday, November 6
Wednesday, November 13
Check out some of our past speakers!
Learn
Master the fundamentals that are transforming enterprise data:
Dive deep into the technology that's making traditional data warehouses obsolete. At Subsurface, you'll uncover how Apache Arrow-powered columnar processing delivers 20X faster performance than legacy systems, while autonomous optimization eliminates manual tuning.
Perfect for:
Data architects, engineers, and technical leaders ready to future-proof their data infrastructure.
What you'll learn:
Grow
Transform your expertise and drive measurable business results:
Position yourself at the forefront of the data revolution. Subsurface isn't just about learning new technology—it's about gaining the strategic insights and hands-on skills that will accelerate your career and deliver transformational results for your organization.
Perfect for:
Ambitious data professionals, consultants, and business leaders looking to lead their organization's AI transformation.
How you'll grow:
Engage
Join an exclusive community of data innovators:
Subsurface brings together the most forward-thinking data leaders, customers, and innovators in an intimate setting designed for meaningful connections. Whether you're solving complex data challenges or exploring new AI opportunities, you'll find your tribe here.
Perfect for:
Existing customers, strategic partners, and select prospects ready to be part of shaping the intelligent lakehouse ecosystem.
How you'll engage:
Tuesday, October 14, 2025
Mandarin Oriental Lutetia, Paris, 47 Bd Raspail, 75006 Paris, France
Enterprises are moving beyond AI experiments and copilots for individual productivity toward the next stage: the Agentic Enterprise, where AI agents drive insights, decisions, and workflows at scale. In this session, Sendur Sellakumar, CEO of Dremio, will outline the enterprise AI maturity journey and show why most organizations are still stuck between prototypes and production outcomes. Through live demos and real-world use cases, you will see how Dremio’s open lakehouse platform provides the intelligent data foundation needed to advance. It delivers semantic consistency, zero ETL federation, autonomous optimization, and built-in governance. Learn how organizations can progress from isolated AI pilots to enterprise-wide impact, and why rethinking data architecture is essential for making AI deliver real business value.
Join Julien Deschamps, Data Product Manager, and Carlos Lopez Hernandez, Product Owner of Dremio at BNP Paribas Personal Finance, as they share how their team is transforming the way data is accessed and consolidated. In this session, you’ll hear how BNP Paribas Personal Finance leverages Dremio to simplify data integration, accelerate insights, and empower business stakeholders with trusted, scalable access to data—without the complexity of traditional consolidation approaches.
As digital sovereignty becomes a critical priority for public organizations, Urssaf Caisse Nationale has launched an ambitious initiative with the creation of its Datafabrique. This program is designed to build a sovereign data architecture that ensures security, quality, and control, while also enabling innovation and agility.
This session will share a concrete case study, highlighting the main challenges faced, the strategic decisions made, and the key lessons learned throughout the transformation.
Open source is at the heart of the modern data lakehouse, shaping how organisations achieve flexibility, interoperability, and performance at scale. This panel brings together leading community members from Apache Iceberg, Apache Arrow, and Apache Polaris to explore the present and future of Lakehouse OSS. Panelists will discuss how these projects are evolving, the challenges they’re addressing, and the opportunities they unlock for building truly open architectures. Join us for an engaging conversation on how open source innovation continues to push the boundaries of what’s possible in data platforms.
Join us for The Dremio Iceberg and Agentic AI Experience (Workshop)—a hands-on session designed to show how Dremio simplifies building and managing Apache Iceberg lakehouses while unlocking new frontiers in AI. Through guided exercises, you’ll learn how to quickly implement and operate Iceberg tables in Dremio’s open lakehouse platform, making your data more accessible, governed, and ready for analytics at scale. The workshop will also introduce Dremio’s MCP (Model Context Protocol) server and demonstrate how it enables agentic AI to seamlessly query, reason over, and act on your data. Whether you’re a data engineer, architect, or AI practitioner, this interactive experience will give you practical skills for unifying your data in Iceberg and empowering intelligent agents to put it to work.
Thursday, October 16, 2025
Everyman Broadgate Cinema. 1 Finsbury Ave, London EC2M 2PF
Enterprises are moving beyond AI experiments and copilots for individual productivity toward the next stage: the Agentic Enterprise, where AI agents drive insights, decisions, and workflows at scale. In this session, Sendur Sellakumar, CEO of Dremio, will outline the enterprise AI maturity journey and show why most organizations are still stuck between prototypes and production outcomes. Through live demos and real-world use cases, you will see how Dremio’s open lakehouse platform provides the intelligent data foundation needed to advance. It delivers semantic consistency, zero ETL federation, autonomous optimization, and built-in governance. Learn how organizations can progress from isolated AI pilots to enterprise-wide impact, and why rethinking data architecture is essential for making AI deliver real business value.
Merlin, the “fourth global major” in music rights, represents 15% of the recorded music market and manages over a petabyte of data from 40+ digital service providers. With fewer than 50 employees, the team processes billions of royalty transactions monthly and delivers timely insights to 600+ members worldwide.
In this session, Gary Watson, Director of Data Operations, will share how Merlin uses Dremio’s lakehouse to accelerate royalty verification, uncover anomalies, and power self-service analytics—all without costly data warehouse rebuilds. Learn how a lean team turns massive complexity into agility and scale.
Discover how Genomics England is leveraging Dremio to modernise structured data storage and provisioning for the National Genomic Research Library. We’ll share how Dremio supports the management of data products today, and explore our future plans for enhancing user management, complex access control and enabling scalable, secure research access.
Open source is at the heart of the modern data lakehouse, shaping how organisations achieve flexibility, interoperability, and performance at scale. This panel brings together leading community members from Apache Iceberg, Apache Arrow, and Apache Polaris to explore the present and future of Lakehouse OSS. Panelists will discuss how these projects are evolving, the challenges they’re addressing, and the opportunities they unlock for building truly open architectures. Join us for an engaging conversation on how open source innovation continues to push the boundaries of what’s possible in data platforms.
Join us for The Dremio Iceberg and Agentic AI Experience (Workshop)—a hands-on session designed to show how Dremio simplifies building and managing Apache Iceberg lakehouses while unlocking new frontiers in AI. Through guided exercises, you’ll learn how to quickly implement and operate Iceberg tables in Dremio’s open lakehouse platform, making your data more accessible, governed, and ready for analytics at scale. The workshop will also introduce Dremio’s MCP (Model Context Protocol) server and demonstrate how it enables agentic AI to seamlessly query, reason over, and act on your data. Whether you’re a data engineer, architect, or AI practitioner, this interactive experience will give you practical skills for unifying your data in Iceberg and empowering intelligent agents to put it to work.
Wednesday, October 29, 2025
DATEV IT Campus - Fürther Str. 111, 90429 Nürnberg, Germany
Enterprises are moving beyond AI experiments and copilots for individual productivity toward the next stage: the Agentic Enterprise, where AI agents drive insights, decisions, and workflows at scale. In this session, Rahim Bohjani, CTO of Dremio, will outline the enterprise AI maturity journey and show why most organizations are still stuck between prototypes and production outcomes. Through live demos and real-world use cases, you will see how Dremio’s open lakehouse platform provides the intelligent data foundation needed to advance. It delivers semantic consistency, zero ETL federation, autonomous optimization, and built-in governance. Learn how organizations can progress from isolated AI pilots to enterprise-wide impact, and why rethinking data architecture is essential for making AI deliver real business value.
Trusted Data Solutions in sensitive industries such as healthcare and the public sector must combine sovereignty, compliance, and scalability with the ability to integrate diverse data sources.
Danyel Gros and Marcel Just from STACKIT will share insights into how the sovereign cloud foundation of STACKIT and the open lakehouse platform of Dremio work together to provide consistent access, built-in governance, and efficient data management without heavy integration overhead.
This combination will be illustrated through the example of a Medical Data Platform, showing how sensitive information can be managed securely while enabling a scalable and future-proof data foundation for research and healthcare services.
The lakehouse has become the dominant paradigm for modern analytics and AI, but its inherent complexity often frustrates enterprises. In this session, Thomas Zeutschler from BARC takes a neutral analyst’s perspective to explore how the lakehouse has evolved from buzzword to production reality, where the true challenges lie, and how innovations are making life easier for both data engineers and business users. With a special focus on data sovereignty and sovereign cloud platforms such as StackIT, the talk will highlight when a single-platform strategy makes sense and when a multi-vendor approach delivers the best of both worlds. Expect praise, critique, and a few laughs along the way.
In this session, DATEV outlines their data landscape and discuss pre-Dremio challenges. They showcase their current self-service analytics process using Dremio, detailing their technical infrastructure.
Ever felt like accessing production data requires a secret handshake with IT – and maybe some cookies? At Schaeffler, our data scientists decided it’s time to break the cycle. In this session, we’ll share how we’re using Dremio to make large-scale production data AVAILABLE TO ALL – no magic spells or endless helpdesk tickets required.
We’ll take you through:
The not-always-easy journey from “Can I have that dataset, please?” to true data democratization
Two real-world use cases that sparked our transformation, including our motivations and guiding principles
What it looks like behind the curtain
Where we’re heading next – AI, chatbots, and whatever disruptions come our way!
This talk takes you on a journey through the evolution of data transformation—from traditional rollout processes rooted in data warehouses, to modern, automated workflows designed for the lakehouse. We’ll explore practical approaches using Dremio and declarative frameworks like dbt, highlighting how teams can transition toward scalable, maintainable solutions. The session concludes with a look at Agentic AI: what’s already possible today, and where intelligent automation might take us next.
Thursday, November 6, 2025
Club Sportivia - 521 Charcot Ave, San Jose, CA
Join us for The Dremio Iceberg and Agentic AI Experience (Workshop)—a hands-on session designed to show how Dremio simplifies building and managing Apache Iceberg lakehouses while unlocking new frontiers in AI. Through guided exercises, you’ll learn how to quickly implement and operate Iceberg tables in Dremio’s open lakehouse platform, making your data more accessible, governed, and ready for analytics at scale. The workshop will also introduce Dremio’s MCP (Model Context Protocol) server and demonstrate how it enables agentic AI to seamlessly query, reason over, and act on your data. Whether you’re a data engineer, architect, or AI practitioner, this interactive experience will give you practical skills for unifying your data in Iceberg and empowering intelligent agents to put it to work.
Enterprises are moving beyond AI experiments and copilots for individual productivity toward the next stage: the Agentic Enterprise, where AI agents drive insights, decisions, and workflows at scale. In this session, Sendur Sellakumar, CEO of Dremio, will outline the enterprise AI maturity journey and show why most organizations are still stuck between prototypes and production outcomes. Through live demos and real-world use cases, you will see how Dremio’s open lakehouse platform provides the intelligent data foundation needed to advance. It delivers semantic consistency, zero ETL federation, autonomous optimization, and built-in governance. Learn how organizations can progress from isolated AI pilots to enterprise-wide impact, and why rethinking data architecture is essential for making AI deliver real business value.
The Challenge
Granicus (14 acquisitions, 5 countries) struggled with fragmented government data across 14+ sources: Salesforce CRM, EHQ MySQL, Matomo streams, GovD Oracle, Open Cities scraping, and more. Needed unified citizen analytics with regulatory compliance.
Technical Solution
– Nessie Catalog + Generative AI
– Iceberg metadata format with AWS S3 auto-reflections
– OpenSearch knowledge base for context-aware AI responses
Advanced Semantic Layers
– Complex multi-domain models with automated reflection optimization
– Query times: minutes → milliseconds through intelligent materialization
Dremio MCP NLP-to-SQL
– “Show housing meetings with >100 attendees last quarter” → Instant SQL
– Government-trained language models with optimal query generation
Dashboard Architecture
– Highcharts integration via Dremio REST APIs
– Intelligent caching for sub-second UI responses
– Real-time citizen engagement metrics
Zero-Copy Pipeline
– 14+ sources unified without data movement
Live Demos:
– NLP query: “Find legislation about housing with citizen feedback >8”
– Real-time reflections optimization
– Highcharts dashboard with live government KPIs
– REST API caching performance comparison
Results:
– 95% faster time-to-insight
– 60% cost reduction through reflections
– 300% increase in self-service adoption
Takeaways:
– Nessie + Iceberg configuration guide
– NLP-to-SQL with Dremio MCPs
– Highcharts-REST API integration patterns
– Reflections optimization playbook
Experience instant citizen insights through plain English queries powered by Dremio’s zero-copy architecture and intelligent semantic optimization.
As organizations adopt AI at scale, managing data governance becomes crucial to ensure trust, compliance, and security. In this session, we will explore how Dremio enables governance in the AI era by providing a unified, secure, and high performance data platform. Learn how to implement policies that ensure data quality, maintain lineage, and enforce access controls while empowering data teams to deliver AI insights faster. Whether you are building AI models or deploying analytics across the enterprise, discover practical strategies for governing your data without slowing innovation.
Unlock the power of open data! Explore how Iceberg’s REST spec and Polaris enable seamless interoperability across your data ecosystem. Connect tools, share insights, and keep your analytics flowing—open, flexible, and friction-free.
Modern cybersecurity systems rely heavily on complex data infrastructures to detect threats, analyze risks, and enforce policies at scale. However, these infrastructures are often fragmented, expensive to maintain, and difficult to evolve. This survey examines the common practices and architectural patterns in cybersecurity data infrastructure—focusing on log pipelines, SIEM platforms, data lakes, and threat detection engines—and identifies their limitations in handling real-time, interconnected data. We highlight the challenges in achieving high performance, explainability, and scalability in traditional setups. To address these challenges, we propose a graph-based approach built on the Data Lakehouse architecture, integrating technologies such as Nessie, Dremio, Apache Iceberg, and PuppyGraph, a graph query engine optimized for Iceberg tables. By modeling cybersecurity data as a connected graph rather than isolated logs or events, PuppyGraph enables more intuitive threat detection, faster investigation, and a streamlined architecture with reduced ETL complexity. We present real-world case studies from industry adopters to illustrate the simplification and performance improvements enabled by this paradigm shift.
Open source is at the heart of the modern data lakehouse, shaping how organizations achieve flexibility, interoperability, and performance at scale. This panel brings together leading community members from Apache Iceberg, Apache Arrow, and Apache Polaris to explore the present and future of Lakehouse OSS. Panelists will discuss how these projects are evolving, the challenges they’re addressing, and the opportunities they unlock for building truly open architectures. Join us for an engaging conversation on how open source innovation continues to push the boundaries of what’s possible in data platforms.
Thursday, November 13, 2025
Convene - 360 Madison Avenue
Tuesday, October 14, 2025
Mandarin Oriental Lutetia, Paris, 47 Bd Raspail, 75006 Paris, France
9:00 AM - 9:10 AM
9:10 AM - 10:30 AM
Enterprises are moving beyond AI experiments and copilots for individual productivity toward the next stage: the Agentic Enterprise, where AI agents drive insights, decisions, and workflows at scale. In this session, Sendur Sellakumar, CEO of Dremio, will outline the enterprise AI maturity journey and show why most organizations are still stuck between prototypes and production outcomes. Through live demos and real-world use cases, you will see how Dremio’s open lakehouse platform provides the intelligent data foundation needed to advance. It delivers semantic consistency, zero ETL federation, autonomous optimization, and built-in governance. Learn how organizations can progress from isolated AI pilots to enterprise-wide impact, and why rethinking data architecture is essential for making AI deliver real business value.
10:30 AM - 10:50 AM
10:50 AM - 11:20 AM
Join Julien Deschamps, Data Product Manager, and Carlos Lopez Hernandez, Product Owner of Dremio at BNP Paribas Personal Finance, as they share how their team is transforming the way data is accessed and consolidated. In this session, you’ll hear how BNP Paribas Personal Finance leverages Dremio to simplify data integration, accelerate insights, and empower business stakeholders with trusted, scalable access to data—without the complexity of traditional consolidation approaches.
11:20 AM - 11:50 AM
As digital sovereignty becomes a critical priority for public organizations, Urssaf Caisse Nationale has launched an ambitious initiative with the creation of its Datafabrique. This program is designed to build a sovereign data architecture that ensures security, quality, and control, while also enabling innovation and agility.
This session will share a concrete case study, highlighting the main challenges faced, the strategic decisions made, and the key lessons learned throughout the transformation.
11:50 AM - 12:10 PM
12:10 PM - 12:55 PM
Open source is at the heart of the modern data lakehouse, shaping how organisations achieve flexibility, interoperability, and performance at scale. This panel brings together leading community members from Apache Iceberg, Apache Arrow, and Apache Polaris to explore the present and future of Lakehouse OSS. Panelists will discuss how these projects are evolving, the challenges they’re addressing, and the opportunities they unlock for building truly open architectures. Join us for an engaging conversation on how open source innovation continues to push the boundaries of what’s possible in data platforms.
12:55 PM - 1:00 PM
1:00 PM - 2:00 PM
2PM - 4PM
Join us for The Dremio Iceberg and Agentic AI Experience (Workshop)—a hands-on session designed to show how Dremio simplifies building and managing Apache Iceberg lakehouses while unlocking new frontiers in AI. Through guided exercises, you’ll learn how to quickly implement and operate Iceberg tables in Dremio’s open lakehouse platform, making your data more accessible, governed, and ready for analytics at scale. The workshop will also introduce Dremio’s MCP (Model Context Protocol) server and demonstrate how it enables agentic AI to seamlessly query, reason over, and act on your data. Whether you’re a data engineer, architect, or AI practitioner, this interactive experience will give you practical skills for unifying your data in Iceberg and empowering intelligent agents to put it to work.
Thursday, October 16, 2025
Everyman Broadgate Cinema. 1 Finsbury Ave, London EC2M 2PF
9:00 AM - 10:00 AM
10:00 AM - 10:10 AM
10:10 AM - 11:30 AM
Enterprises are moving beyond AI experiments and copilots for individual productivity toward the next stage: the Agentic Enterprise, where AI agents drive insights, decisions, and workflows at scale. In this session, Sendur Sellakumar, CEO of Dremio, will outline the enterprise AI maturity journey and show why most organizations are still stuck between prototypes and production outcomes. Through live demos and real-world use cases, you will see how Dremio’s open lakehouse platform provides the intelligent data foundation needed to advance. It delivers semantic consistency, zero ETL federation, autonomous optimization, and built-in governance. Learn how organizations can progress from isolated AI pilots to enterprise-wide impact, and why rethinking data architecture is essential for making AI deliver real business value.
11:30 AM - 11:50 AM
11:50 AM - 12:20 PM
Merlin, the “fourth global major” in music rights, represents 15% of the recorded music market and manages over a petabyte of data from 40+ digital service providers. With fewer than 50 employees, the team processes billions of royalty transactions monthly and delivers timely insights to 600+ members worldwide.
In this session, Gary Watson, Director of Data Operations, will share how Merlin uses Dremio’s lakehouse to accelerate royalty verification, uncover anomalies, and power self-service analytics—all without costly data warehouse rebuilds. Learn how a lean team turns massive complexity into agility and scale.
12:20 PM - 12:50 PM
Discover how Genomics England is leveraging Dremio to modernise structured data storage and provisioning for the National Genomic Research Library. We’ll share how Dremio supports the management of data products today, and explore our future plans for enhancing user management, complex access control and enabling scalable, secure research access.
12:50 PM - 1:35 PM
1:35 PM - 2:25 PM
Open source is at the heart of the modern data lakehouse, shaping how organisations achieve flexibility, interoperability, and performance at scale. This panel brings together leading community members from Apache Iceberg, Apache Arrow, and Apache Polaris to explore the present and future of Lakehouse OSS. Panelists will discuss how these projects are evolving, the challenges they’re addressing, and the opportunities they unlock for building truly open architectures. Join us for an engaging conversation on how open source innovation continues to push the boundaries of what’s possible in data platforms.
2:25 PM - 2:30 PM
2:30 PM - 3:30 PM
3:30 PM - 5:30 PM
Join us for The Dremio Iceberg and Agentic AI Experience (Workshop)—a hands-on session designed to show how Dremio simplifies building and managing Apache Iceberg lakehouses while unlocking new frontiers in AI. Through guided exercises, you’ll learn how to quickly implement and operate Iceberg tables in Dremio’s open lakehouse platform, making your data more accessible, governed, and ready for analytics at scale. The workshop will also introduce Dremio’s MCP (Model Context Protocol) server and demonstrate how it enables agentic AI to seamlessly query, reason over, and act on your data. Whether you’re a data engineer, architect, or AI practitioner, this interactive experience will give you practical skills for unifying your data in Iceberg and empowering intelligent agents to put it to work.
Wednesday, October 29, 2025
DATEV IT Campus - Fürther Str. 111, 90429 Nürnberg, Germany
12:30 PM - 1:15 PM
1:15 PM - 1:25 PM
1:25 PM - 2:40 PM
Enterprises are moving beyond AI experiments and copilots for individual productivity toward the next stage: the Agentic Enterprise, where AI agents drive insights, decisions, and workflows at scale. In this session, Rahim Bohjani, CTO of Dremio, will outline the enterprise AI maturity journey and show why most organizations are still stuck between prototypes and production outcomes. Through live demos and real-world use cases, you will see how Dremio’s open lakehouse platform provides the intelligent data foundation needed to advance. It delivers semantic consistency, zero ETL federation, autonomous optimization, and built-in governance. Learn how organizations can progress from isolated AI pilots to enterprise-wide impact, and why rethinking data architecture is essential for making AI deliver real business value.
2:40 PM - 2:55 PM
Trusted Data Solutions in sensitive industries such as healthcare and the public sector must combine sovereignty, compliance, and scalability with the ability to integrate diverse data sources.
Danyel Gros and Marcel Just from STACKIT will share insights into how the sovereign cloud foundation of STACKIT and the open lakehouse platform of Dremio work together to provide consistent access, built-in governance, and efficient data management without heavy integration overhead.
This combination will be illustrated through the example of a Medical Data Platform, showing how sensitive information can be managed securely while enabling a scalable and future-proof data foundation for research and healthcare services.
2:55 PM - 3:15 PM
3:15 PM - 3:35 PM
The lakehouse has become the dominant paradigm for modern analytics and AI, but its inherent complexity often frustrates enterprises. In this session, Thomas Zeutschler from BARC takes a neutral analyst’s perspective to explore how the lakehouse has evolved from buzzword to production reality, where the true challenges lie, and how innovations are making life easier for both data engineers and business users. With a special focus on data sovereignty and sovereign cloud platforms such as StackIT, the talk will highlight when a single-platform strategy makes sense and when a multi-vendor approach delivers the best of both worlds. Expect praise, critique, and a few laughs along the way.
3:35 PM - 3:55 PM
In this session, DATEV outlines their data landscape and discuss pre-Dremio challenges. They showcase their current self-service analytics process using Dremio, detailing their technical infrastructure.
3:55 PM - 4:15 PM
Ever felt like accessing production data requires a secret handshake with IT – and maybe some cookies? At Schaeffler, our data scientists decided it’s time to break the cycle. In this session, we’ll share how we’re using Dremio to make large-scale production data AVAILABLE TO ALL – no magic spells or endless helpdesk tickets required.
We’ll take you through:
The not-always-easy journey from “Can I have that dataset, please?” to true data democratization
Two real-world use cases that sparked our transformation, including our motivations and guiding principles
What it looks like behind the curtain
Where we’re heading next – AI, chatbots, and whatever disruptions come our way!
4:15 PM - 4:35 PM
4:35 PM - 4:55 PM
This talk takes you on a journey through the evolution of data transformation—from traditional rollout processes rooted in data warehouses, to modern, automated workflows designed for the lakehouse. We’ll explore practical approaches using Dremio and declarative frameworks like dbt, highlighting how teams can transition toward scalable, maintainable solutions. The session concludes with a look at Agentic AI: what’s already possible today, and where intelligent automation might take us next.
4:55 PM - 5:15 PM
5:15 PM - 5:20 PM
5:20 PM - 5:50 PM
Thursday, November 6, 2025
Club Sportivia - 521 Charcot Ave, San Jose, CA
10:00 AM - 12:00 PM
Join us for The Dremio Iceberg and Agentic AI Experience (Workshop)—a hands-on session designed to show how Dremio simplifies building and managing Apache Iceberg lakehouses while unlocking new frontiers in AI. Through guided exercises, you’ll learn how to quickly implement and operate Iceberg tables in Dremio’s open lakehouse platform, making your data more accessible, governed, and ready for analytics at scale. The workshop will also introduce Dremio’s MCP (Model Context Protocol) server and demonstrate how it enables agentic AI to seamlessly query, reason over, and act on your data. Whether you’re a data engineer, architect, or AI practitioner, this interactive experience will give you practical skills for unifying your data in Iceberg and empowering intelligent agents to put it to work.
12:00 PM - 1:00 PM
1:00 PM - 2:15 PM
Enterprises are moving beyond AI experiments and copilots for individual productivity toward the next stage: the Agentic Enterprise, where AI agents drive insights, decisions, and workflows at scale. In this session, Sendur Sellakumar, CEO of Dremio, will outline the enterprise AI maturity journey and show why most organizations are still stuck between prototypes and production outcomes. Through live demos and real-world use cases, you will see how Dremio’s open lakehouse platform provides the intelligent data foundation needed to advance. It delivers semantic consistency, zero ETL federation, autonomous optimization, and built-in governance. Learn how organizations can progress from isolated AI pilots to enterprise-wide impact, and why rethinking data architecture is essential for making AI deliver real business value.
2:15 PM - 2:45 PM
The Challenge
Granicus (14 acquisitions, 5 countries) struggled with fragmented government data across 14+ sources: Salesforce CRM, EHQ MySQL, Matomo streams, GovD Oracle, Open Cities scraping, and more. Needed unified citizen analytics with regulatory compliance.
Technical Solution
– Nessie Catalog + Generative AI
– Iceberg metadata format with AWS S3 auto-reflections
– OpenSearch knowledge base for context-aware AI responses
Advanced Semantic Layers
– Complex multi-domain models with automated reflection optimization
– Query times: minutes → milliseconds through intelligent materialization
Dremio MCP NLP-to-SQL
– “Show housing meetings with >100 attendees last quarter” → Instant SQL
– Government-trained language models with optimal query generation
Dashboard Architecture
– Highcharts integration via Dremio REST APIs
– Intelligent caching for sub-second UI responses
– Real-time citizen engagement metrics
Zero-Copy Pipeline
– 14+ sources unified without data movement
Live Demos:
– NLP query: “Find legislation about housing with citizen feedback >8”
– Real-time reflections optimization
– Highcharts dashboard with live government KPIs
– REST API caching performance comparison
Results:
– 95% faster time-to-insight
– 60% cost reduction through reflections
– 300% increase in self-service adoption
Takeaways:
– Nessie + Iceberg configuration guide
– NLP-to-SQL with Dremio MCPs
– Highcharts-REST API integration patterns
– Reflections optimization playbook
Experience instant citizen insights through plain English queries powered by Dremio’s zero-copy architecture and intelligent semantic optimization.
2:45 PM - 3:00 PM
3:00 PM - 3:30 PM
As organizations adopt AI at scale, managing data governance becomes crucial to ensure trust, compliance, and security. In this session, we will explore how Dremio enables governance in the AI era by providing a unified, secure, and high performance data platform. Learn how to implement policies that ensure data quality, maintain lineage, and enforce access controls while empowering data teams to deliver AI insights faster. Whether you are building AI models or deploying analytics across the enterprise, discover practical strategies for governing your data without slowing innovation.
3:30 PM - 4:00 PM
Unlock the power of open data! Explore how Iceberg’s REST spec and Polaris enable seamless interoperability across your data ecosystem. Connect tools, share insights, and keep your analytics flowing—open, flexible, and friction-free.
4:00 PM - 4:20 PM
Modern cybersecurity systems rely heavily on complex data infrastructures to detect threats, analyze risks, and enforce policies at scale. However, these infrastructures are often fragmented, expensive to maintain, and difficult to evolve. This survey examines the common practices and architectural patterns in cybersecurity data infrastructure—focusing on log pipelines, SIEM platforms, data lakes, and threat detection engines—and identifies their limitations in handling real-time, interconnected data. We highlight the challenges in achieving high performance, explainability, and scalability in traditional setups. To address these challenges, we propose a graph-based approach built on the Data Lakehouse architecture, integrating technologies such as Nessie, Dremio, Apache Iceberg, and PuppyGraph, a graph query engine optimized for Iceberg tables. By modeling cybersecurity data as a connected graph rather than isolated logs or events, PuppyGraph enables more intuitive threat detection, faster investigation, and a streamlined architecture with reduced ETL complexity. We present real-world case studies from industry adopters to illustrate the simplification and performance improvements enabled by this paradigm shift.
4:20 PM - 5:05 PM
Open source is at the heart of the modern data lakehouse, shaping how organizations achieve flexibility, interoperability, and performance at scale. This panel brings together leading community members from Apache Iceberg, Apache Arrow, and Apache Polaris to explore the present and future of Lakehouse OSS. Panelists will discuss how these projects are evolving, the challenges they’re addressing, and the opportunities they unlock for building truly open architectures. Join us for an engaging conversation on how open source innovation continues to push the boundaries of what’s possible in data platforms.
5:05 PM -6:00 PM
Thursday, November 13, 2025
Convene - 360 Madison Avenue
8:00 AM - 10:00 AM
8:00 AM - 9:45 AM
8:00 AM - 10:00 AM
10:00 AM - 11:30 AM
11:30 AM - 12:30 PM
12:30 PM - 1:15 PM
1:30 PM - 2:15 PM
2:30 PM - 3:15 PM
3:15 PM - 3:30 PM
3:30 PM - 4:15 PM
4:20 PM - 5:00 PM
5:00 PM - 6:00 PM
For more information on sponsorship at Subsurface LIVE 2025, please contact our Sponsorship Management Team.