November 6, 2025

San Jose, USA
Thursday, November 6, 2025
Venue Address
Club Sportivia
521 Charcot Ave, San Jose, CA
Getting There
From the North Bay (San Francisco, Oakland, Berkeley):
Take US-101 South or I-880 South toward San Jose
On US-101
- Take the exit for CA-87 N / Guadalupe Parkway
- Merge onto CA-87 N, then take the exit for Brokaw Road / Charcot Avenue
- Turn right onto Charcot Avenue
Club Sportiva will be on your left shortly after passing Junction Avenue
On I-880
- Take the exit for Montague Expressway
- Turn right onto Montague Expressway
- Turn left onto Junction Avenue
- Turn right onto Charcot Avenue
Club Sportiva will be on your left
From the South Bay (Santa Clara, Cupertino, Sunnyvale, South San Jose):
I-880 North / US-101 North
- Exit at Montague Expressway
- Turn left onto Montague Expressway
- Turn left onto Junction Avenue
- Turn right onto Charcot Avenue
Club Sportiva will be on your left
Public Transit
- Take Caltrain or VTA Light Rail to Milpitas or Santa Clara
- Transfer to a local bus or rideshare for the final leg to Charcot Avenue
- The venue is a short drive from the VTA Component Light Rail Station
Nearby Hotels
Coming from out of town? Here are nearby hotels:
- Hyatt House San Jose Airport6 min
- Sonesta Silicon Valley6 min
Agenda at a Glance: San Jose, USA
San Jose, USA
Thursday, November 6, 2025
Club Sportivia - 521 Charcot Ave, San Jose, CA
The Dremio Iceberg and Agentic AI Experience
Join us for The Dremio Iceberg and Agentic AI Experience (Workshop)—a hands-on session designed to show how Dremio simplifies building and managing Apache Iceberg lakehouses while unlocking new frontiers in AI. Through guided exercises, you’ll learn how to quickly implement and operate Iceberg tables in Dremio’s open lakehouse platform, making your data more accessible, governed, and ready for analytics at scale. The workshop will also introduce Dremio’s MCP (Model Context Protocol) server and demonstrate how it enables agentic AI to seamlessly query, reason over, and act on your data. Whether you’re a data engineer, architect, or AI practitioner, this interactive experience will give you practical skills for unifying your data in Iceberg and empowering intelligent agents to put it to work.
Networking Lunch & Registration
NetworkingEnabling the Agentic Enterprise
Enterprises are moving beyond AI experiments and copilots for individual productivity toward the next stage: the Agentic Enterprise, where AI agents drive insights, decisions, and workflows at scale. In this session, Sendur Sellakumar, CEO of Dremio, will outline the enterprise AI maturity journey and show why most organizations are still stuck between prototypes and production outcomes. Through live demos and real-world use cases, you will see how Dremio’s open lakehouse platform provides the intelligent data foundation needed to advance. It delivers semantic consistency, zero ETL federation, autonomous optimization, and built-in governance. Learn how organizations can progress from isolated AI pilots to enterprise-wide impact, and why rethinking data architecture is essential for making AI deliver real business value.
Granicus Case Study: Building a Zero-Copy Data Mesh with Dremio's Intelligent Semantic Optimization
The Challenge
Granicus (14 acquisitions, 5 countries) struggled with fragmented government data across 14+ sources: Salesforce CRM, EHQ MySQL, Matomo streams, GovD Oracle, Open Cities scraping, and more. Needed unified citizen analytics with regulatory compliance.
Technical Solution
– Nessie Catalog + Generative AI
– Iceberg metadata format with AWS S3 auto-reflections
– OpenSearch knowledge base for context-aware AI responses
Advanced Semantic Layers
– Complex multi-domain models with automated reflection optimization
– Query times: minutes → milliseconds through intelligent materialization
Dremio MCP NLP-to-SQL
– “Show housing meetings with >100 attendees last quarter” → Instant SQL
– Government-trained language models with optimal query generation
Dashboard Architecture
– Highcharts integration via Dremio REST APIs
– Intelligent caching for sub-second UI responses
– Real-time citizen engagement metrics
Zero-Copy Pipeline
– 14+ sources unified without data movement
Live Demos:
– NLP query: “Find legislation about housing with citizen feedback >8”
– Real-time reflections optimization
– Highcharts dashboard with live government KPIs
– REST API caching performance comparison
Results:
– 95% faster time-to-insight
– 60% cost reduction through reflections
– 300% increase in self-service adoption
Takeaways:
– Nessie + Iceberg configuration guide
– NLP-to-SQL with Dremio MCPs
– Highcharts-REST API integration patterns
– Reflections optimization playbook
Experience instant citizen insights through plain English queries powered by Dremio’s zero-copy architecture and intelligent semantic optimization.
Networking Break
BreakGovernance in the AI Era
As organizations adopt AI at scale, managing data governance becomes crucial to ensure trust, compliance, and security. In this session, we will explore how Dremio enables governance in the AI era by providing a unified, secure, and high performance data platform. Learn how to implement policies that ensure data quality, maintain lineage, and enforce access controls while empowering data teams to deliver AI insights faster. Whether you are building AI models or deploying analytics across the enterprise, discover practical strategies for governing your data without slowing innovation.
Iceberg REST Spec & Polaris: Interoperability in the Open Ecosystem
Unlock the power of open data! Explore how Iceberg’s REST spec and Polaris enable seamless interoperability across your data ecosystem. Connect tools, share insights, and keep your analytics flowing—open, flexible, and friction-free.
A Survey of Cybersecurity Data Infra and How to Simplify It with Graph Lakehouse
Modern cybersecurity systems rely heavily on complex data infrastructures to detect threats, analyze risks, and enforce policies at scale. However, these infrastructures are often fragmented, expensive to maintain, and difficult to evolve. This survey examines the common practices and architectural patterns in cybersecurity data infrastructure—focusing on log pipelines, SIEM platforms, data lakes, and threat detection engines—and identifies their limitations in handling real-time, interconnected data. We highlight the challenges in achieving high performance, explainability, and scalability in traditional setups. To address these challenges, we propose a graph-based approach built on the Data Lakehouse architecture, integrating technologies such as Nessie, Dremio, Apache Iceberg, and PuppyGraph, a graph query engine optimized for Iceberg tables. By modeling cybersecurity data as a connected graph rather than isolated logs or events, PuppyGraph enables more intuitive threat detection, faster investigation, and a streamlined architecture with reduced ETL complexity. We present real-world case studies from industry adopters to illustrate the simplification and performance improvements enabled by this paradigm shift.
Lakehouse OSS (Iceberg, Arrow, Polaris) Panel
Open source is at the heart of the modern data lakehouse, shaping how organizations achieve flexibility, interoperability, and performance at scale. This panel brings together leading community members from Apache Iceberg, Apache Arrow, and Apache Polaris to explore the present and future of Lakehouse OSS. Panelists will discuss how these projects are evolving, the challenges they’re addressing, and the opportunities they unlock for building truly open architectures. Join us for an engaging conversation on how open source innovation continues to push the boundaries of what’s possible in data platforms.
Networking Reception
NetworkingSan Jose, USA
Thursday, November 6, 2025
Club Sportivia - 521 Charcot Ave, San Jose, CA
10:00 AM - 12:00 PM
The Dremio Iceberg and Agentic AI Experience
Join us for The Dremio Iceberg and Agentic AI Experience (Workshop)—a hands-on session designed to show how Dremio simplifies building and managing Apache Iceberg lakehouses while unlocking new frontiers in AI. Through guided exercises, you’ll learn how to quickly implement and operate Iceberg tables in Dremio’s open lakehouse platform, making your data more accessible, governed, and ready for analytics at scale. The workshop will also introduce Dremio’s MCP (Model Context Protocol) server and demonstrate how it enables agentic AI to seamlessly query, reason over, and act on your data. Whether you’re a data engineer, architect, or AI practitioner, this interactive experience will give you practical skills for unifying your data in Iceberg and empowering intelligent agents to put it to work.
12:00 PM - 1:00 PM
Networking Lunch & Registration
Networking1:00 PM - 2:15 PM
Enabling the Agentic Enterprise
Enterprises are moving beyond AI experiments and copilots for individual productivity toward the next stage: the Agentic Enterprise, where AI agents drive insights, decisions, and workflows at scale. In this session, Sendur Sellakumar, CEO of Dremio, will outline the enterprise AI maturity journey and show why most organizations are still stuck between prototypes and production outcomes. Through live demos and real-world use cases, you will see how Dremio’s open lakehouse platform provides the intelligent data foundation needed to advance. It delivers semantic consistency, zero ETL federation, autonomous optimization, and built-in governance. Learn how organizations can progress from isolated AI pilots to enterprise-wide impact, and why rethinking data architecture is essential for making AI deliver real business value.
2:15 PM - 2:45 PM
Granicus Case Study: Building a Zero-Copy Data Mesh with Dremio's Intelligent Semantic Optimization
The Challenge
Granicus (14 acquisitions, 5 countries) struggled with fragmented government data across 14+ sources: Salesforce CRM, EHQ MySQL, Matomo streams, GovD Oracle, Open Cities scraping, and more. Needed unified citizen analytics with regulatory compliance.
Technical Solution
– Nessie Catalog + Generative AI
– Iceberg metadata format with AWS S3 auto-reflections
– OpenSearch knowledge base for context-aware AI responses
Advanced Semantic Layers
– Complex multi-domain models with automated reflection optimization
– Query times: minutes → milliseconds through intelligent materialization
Dremio MCP NLP-to-SQL
– “Show housing meetings with >100 attendees last quarter” → Instant SQL
– Government-trained language models with optimal query generation
Dashboard Architecture
– Highcharts integration via Dremio REST APIs
– Intelligent caching for sub-second UI responses
– Real-time citizen engagement metrics
Zero-Copy Pipeline
– 14+ sources unified without data movement
Live Demos:
– NLP query: “Find legislation about housing with citizen feedback >8”
– Real-time reflections optimization
– Highcharts dashboard with live government KPIs
– REST API caching performance comparison
Results:
– 95% faster time-to-insight
– 60% cost reduction through reflections
– 300% increase in self-service adoption
Takeaways:
– Nessie + Iceberg configuration guide
– NLP-to-SQL with Dremio MCPs
– Highcharts-REST API integration patterns
– Reflections optimization playbook
Experience instant citizen insights through plain English queries powered by Dremio’s zero-copy architecture and intelligent semantic optimization.
2:45 PM - 3:00 PM
Networking Break
Break3:00 PM - 3:30 PM
Governance in the AI Era
As organizations adopt AI at scale, managing data governance becomes crucial to ensure trust, compliance, and security. In this session, we will explore how Dremio enables governance in the AI era by providing a unified, secure, and high performance data platform. Learn how to implement policies that ensure data quality, maintain lineage, and enforce access controls while empowering data teams to deliver AI insights faster. Whether you are building AI models or deploying analytics across the enterprise, discover practical strategies for governing your data without slowing innovation.
3:30 PM - 4:00 PM
Iceberg REST Spec & Polaris: Interoperability in the Open Ecosystem
Unlock the power of open data! Explore how Iceberg’s REST spec and Polaris enable seamless interoperability across your data ecosystem. Connect tools, share insights, and keep your analytics flowing—open, flexible, and friction-free.
4:00 PM - 4:20 PM
A Survey of Cybersecurity Data Infra and How to Simplify It with Graph Lakehouse
Modern cybersecurity systems rely heavily on complex data infrastructures to detect threats, analyze risks, and enforce policies at scale. However, these infrastructures are often fragmented, expensive to maintain, and difficult to evolve. This survey examines the common practices and architectural patterns in cybersecurity data infrastructure—focusing on log pipelines, SIEM platforms, data lakes, and threat detection engines—and identifies their limitations in handling real-time, interconnected data. We highlight the challenges in achieving high performance, explainability, and scalability in traditional setups. To address these challenges, we propose a graph-based approach built on the Data Lakehouse architecture, integrating technologies such as Nessie, Dremio, Apache Iceberg, and PuppyGraph, a graph query engine optimized for Iceberg tables. By modeling cybersecurity data as a connected graph rather than isolated logs or events, PuppyGraph enables more intuitive threat detection, faster investigation, and a streamlined architecture with reduced ETL complexity. We present real-world case studies from industry adopters to illustrate the simplification and performance improvements enabled by this paradigm shift.
4:20 PM - 5:05 PM
Lakehouse OSS (Iceberg, Arrow, Polaris) Panel
Open source is at the heart of the modern data lakehouse, shaping how organizations achieve flexibility, interoperability, and performance at scale. This panel brings together leading community members from Apache Iceberg, Apache Arrow, and Apache Polaris to explore the present and future of Lakehouse OSS. Panelists will discuss how these projects are evolving, the challenges they’re addressing, and the opportunities they unlock for building truly open architectures. Join us for an engaging conversation on how open source innovation continues to push the boundaries of what’s possible in data platforms.
5:05 PM -6:00 PM
Networking Reception
NetworkingOct 14
Paris

Oct 16
London

Oct 29
Nuremberg

Nov 6
San Jose

Nov 13
NYC






Sponsorships Available
For more information on sponsorship at Subsurface LIVE 2025, please contact our Sponsorship Management Team.