November 6, 2025

SAN JOSE, USA

San Jose, USA

Thursday, November 6, 2025

Club Sportivia - 521 Charcot Ave, San Jose, CA

10:00 AM - 12:00 PM

The Dremio Iceberg and Agentic AI Experience

Join us for The Dremio Iceberg and Agentic AI Experience (Workshop)—a hands-on session designed to show how Dremio simplifies building and managing Apache Iceberg lakehouses while unlocking new frontiers in AI. Through guided exercises, you’ll learn how to quickly implement and operate Iceberg tables in Dremio’s open lakehouse platform, making your data more accessible, governed, and ready for analytics at scale. The workshop will also introduce Dremio’s MCP (Model Context Protocol) server and demonstrate how it enables agentic AI to seamlessly query, reason over, and act on your data. Whether you’re a data engineer, architect, or AI practitioner, this interactive experience will give you practical skills for unifying your data in Iceberg and empowering intelligent agents to put it to work.

Workshop
12:00 PM - 1:00 PM

Networking Lunch & Registration

Networking
1:00 PM - 2:15 PM

Enabling the Agentic Enterprise

Enterprises are moving beyond AI experiments and copilots for individual productivity toward the next stage: the Agentic Enterprise, where AI agents drive insights, decisions, and workflows at scale. In this session, Sendur Sellakumar, CEO of Dremio, will outline the enterprise AI maturity journey and show why most organizations are still stuck between prototypes and production outcomes. Through live demos and real-world use cases, you will see how Dremio’s open lakehouse platform provides the intelligent data foundation needed to advance. It delivers semantic consistency, zero ETL federation, autonomous optimization, and built-in governance. Learn how organizations can progress from isolated AI pilots to enterprise-wide impact, and why rethinking data architecture is essential for making AI deliver real business value.

Keynote
2:15 PM - 2:45 PM

Granicus Case Study: Building a Zero-Copy Data Mesh with Dremio's Intelligent Semantic Optimization

The Challenge

Granicus (14 acquisitions, 5 countries) struggled with fragmented government data across 14+ sources: Salesforce CRM, EHQ MySQL, Matomo streams, GovD Oracle, Open Cities scraping, and more. Needed unified citizen analytics with regulatory compliance.

Technical Solution

– Nessie Catalog + Generative AI
– Iceberg metadata format with AWS S3 auto-reflections
– OpenSearch knowledge base for context-aware AI responses
Advanced Semantic Layers

– Complex multi-domain models with automated reflection optimization
– Query times: minutes → milliseconds through intelligent materialization
Dremio MCP NLP-to-SQL

– “Show housing meetings with >100 attendees last quarter” → Instant SQL
– Government-trained language models with optimal query generation
Dashboard Architecture

– Highcharts integration via Dremio REST APIs
– Intelligent caching for sub-second UI responses
– Real-time citizen engagement metrics
Zero-Copy Pipeline

– 14+ sources unified without data movement
Live Demos:

– NLP query: “Find legislation about housing with citizen feedback >8”
– Real-time reflections optimization
– Highcharts dashboard with live government KPIs
– REST API caching performance comparison
Results:

– 95% faster time-to-insight
– 60% cost reduction through reflections
– 300% increase in self-service adoption
Takeaways:

– Nessie + Iceberg configuration guide
– NLP-to-SQL with Dremio MCPs
– Highcharts-REST API integration patterns
– Reflections optimization playbook
Experience instant citizen insights through plain English queries powered by Dremio’s zero-copy architecture and intelligent semantic optimization.

Presentation
2:45 PM - 3:00 PM

Networking Break

Break
3:00 PM - 3:30 PM

Governance in the AI Era

As organizations adopt AI at scale, managing data governance becomes crucial to ensure trust, compliance, and security. In this session, we will explore how Dremio enables governance in the AI era by providing a unified, secure, and high performance data platform. Learn how to implement policies that ensure data quality, maintain lineage, and enforce access controls while empowering data teams to deliver AI insights faster. Whether you are building AI models or deploying analytics across the enterprise, discover practical strategies for governing your data without slowing innovation.

Presentation
3:30 PM - 4:00 PM

Iceberg REST Spec & Polaris: Interoperability in the Open Ecosystem

Unlock the power of open data! Explore how Iceberg’s REST spec and Polaris enable seamless interoperability across your data ecosystem. Connect tools, share insights, and keep your analytics flowing—open, flexible, and friction-free.

Presentation
4:00 PM - 4:20 PM

A Survey of Cybersecurity Data Infra and How to Simplify It with Graph Lakehouse

Modern cybersecurity systems rely heavily on complex data infrastructures to detect threats, analyze risks, and enforce policies at scale. However, these infrastructures are often fragmented, expensive to maintain, and difficult to evolve. This survey examines the common practices and architectural patterns in cybersecurity data infrastructure—focusing on log pipelines, SIEM platforms, data lakes, and threat detection engines—and identifies their limitations in handling real-time, interconnected data. We highlight the challenges in achieving high performance, explainability, and scalability in traditional setups. To address these challenges, we propose a graph-based approach built on the Data Lakehouse architecture, integrating technologies such as Nessie, Dremio, Apache Iceberg, and PuppyGraph, a graph query engine optimized for Iceberg tables. By modeling cybersecurity data as a connected graph rather than isolated logs or events, PuppyGraph enables more intuitive threat detection, faster investigation, and a streamlined architecture with reduced ETL complexity. We present real-world case studies from industry adopters to illustrate the simplification and performance improvements enabled by this paradigm shift.

Keynote
4:20 PM - 5:05 PM

Lakehouse OSS (Iceberg, Arrow, Polaris) Panel

Open source is at the heart of the modern data lakehouse, shaping how organizations achieve flexibility, interoperability, and performance at scale. This panel brings together leading community members from Apache Iceberg, Apache Arrow, and Apache Polaris to explore the present and future of Lakehouse OSS. Panelists will discuss how these projects are evolving, the challenges they’re addressing, and the opportunities they unlock for building truly open architectures. Join us for an engaging conversation on how open source innovation continues to push the boundaries of what’s possible in data platforms.

Panel
5:05 PM -6:00 PM

Networking Reception

Networking

San Jose, USA

Thursday, November 6, 2025

Club Sportivia - 521 Charcot Ave, San Jose, CA

10:00 AM - 12:00 PM

The Dremio Iceberg and Agentic AI Experience

Join us for The Dremio Iceberg and Agentic AI Experience (Workshop)—a hands-on session designed to show how Dremio simplifies building and managing Apache Iceberg lakehouses while unlocking new frontiers in AI. Through guided exercises, you’ll learn how to quickly implement and operate Iceberg tables in Dremio’s open lakehouse platform, making your data more accessible, governed, and ready for analytics at scale. The workshop will also introduce Dremio’s MCP (Model Context Protocol) server and demonstrate how it enables agentic AI to seamlessly query, reason over, and act on your data. Whether you’re a data engineer, architect, or AI practitioner, this interactive experience will give you practical skills for unifying your data in Iceberg and empowering intelligent agents to put it to work.

Workshop

12:00 PM - 1:00 PM

Networking Lunch & Registration

Networking

1:00 PM - 2:15 PM

Enabling the Agentic Enterprise

Enterprises are moving beyond AI experiments and copilots for individual productivity toward the next stage: the Agentic Enterprise, where AI agents drive insights, decisions, and workflows at scale. In this session, Sendur Sellakumar, CEO of Dremio, will outline the enterprise AI maturity journey and show why most organizations are still stuck between prototypes and production outcomes. Through live demos and real-world use cases, you will see how Dremio’s open lakehouse platform provides the intelligent data foundation needed to advance. It delivers semantic consistency, zero ETL federation, autonomous optimization, and built-in governance. Learn how organizations can progress from isolated AI pilots to enterprise-wide impact, and why rethinking data architecture is essential for making AI deliver real business value.

Keynote

2:15 PM - 2:45 PM

Granicus Case Study: Building a Zero-Copy Data Mesh with Dremio's Intelligent Semantic Optimization

The Challenge

Granicus (14 acquisitions, 5 countries) struggled with fragmented government data across 14+ sources: Salesforce CRM, EHQ MySQL, Matomo streams, GovD Oracle, Open Cities scraping, and more. Needed unified citizen analytics with regulatory compliance.

Technical Solution

– Nessie Catalog + Generative AI
– Iceberg metadata format with AWS S3 auto-reflections
– OpenSearch knowledge base for context-aware AI responses
Advanced Semantic Layers

– Complex multi-domain models with automated reflection optimization
– Query times: minutes → milliseconds through intelligent materialization
Dremio MCP NLP-to-SQL

– “Show housing meetings with >100 attendees last quarter” → Instant SQL
– Government-trained language models with optimal query generation
Dashboard Architecture

– Highcharts integration via Dremio REST APIs
– Intelligent caching for sub-second UI responses
– Real-time citizen engagement metrics
Zero-Copy Pipeline

– 14+ sources unified without data movement
Live Demos:

– NLP query: “Find legislation about housing with citizen feedback >8”
– Real-time reflections optimization
– Highcharts dashboard with live government KPIs
– REST API caching performance comparison
Results:

– 95% faster time-to-insight
– 60% cost reduction through reflections
– 300% increase in self-service adoption
Takeaways:

– Nessie + Iceberg configuration guide
– NLP-to-SQL with Dremio MCPs
– Highcharts-REST API integration patterns
– Reflections optimization playbook
Experience instant citizen insights through plain English queries powered by Dremio’s zero-copy architecture and intelligent semantic optimization.

Presentation

2:45 PM - 3:00 PM

Networking Break

Break

3:00 PM - 3:30 PM

Governance in the AI Era

As organizations adopt AI at scale, managing data governance becomes crucial to ensure trust, compliance, and security. In this session, we will explore how Dremio enables governance in the AI era by providing a unified, secure, and high performance data platform. Learn how to implement policies that ensure data quality, maintain lineage, and enforce access controls while empowering data teams to deliver AI insights faster. Whether you are building AI models or deploying analytics across the enterprise, discover practical strategies for governing your data without slowing innovation.

Presentation

3:30 PM - 4:00 PM

Iceberg REST Spec & Polaris: Interoperability in the Open Ecosystem

Unlock the power of open data! Explore how Iceberg’s REST spec and Polaris enable seamless interoperability across your data ecosystem. Connect tools, share insights, and keep your analytics flowing—open, flexible, and friction-free.

Presentation

4:00 PM - 4:20 PM

A Survey of Cybersecurity Data Infra and How to Simplify It with Graph Lakehouse

Modern cybersecurity systems rely heavily on complex data infrastructures to detect threats, analyze risks, and enforce policies at scale. However, these infrastructures are often fragmented, expensive to maintain, and difficult to evolve. This survey examines the common practices and architectural patterns in cybersecurity data infrastructure—focusing on log pipelines, SIEM platforms, data lakes, and threat detection engines—and identifies their limitations in handling real-time, interconnected data. We highlight the challenges in achieving high performance, explainability, and scalability in traditional setups. To address these challenges, we propose a graph-based approach built on the Data Lakehouse architecture, integrating technologies such as Nessie, Dremio, Apache Iceberg, and PuppyGraph, a graph query engine optimized for Iceberg tables. By modeling cybersecurity data as a connected graph rather than isolated logs or events, PuppyGraph enables more intuitive threat detection, faster investigation, and a streamlined architecture with reduced ETL complexity. We present real-world case studies from industry adopters to illustrate the simplification and performance improvements enabled by this paradigm shift.

Keynote

4:20 PM - 5:05 PM

Lakehouse OSS (Iceberg, Arrow, Polaris) Panel

Open source is at the heart of the modern data lakehouse, shaping how organizations achieve flexibility, interoperability, and performance at scale. This panel brings together leading community members from Apache Iceberg, Apache Arrow, and Apache Polaris to explore the present and future of Lakehouse OSS. Panelists will discuss how these projects are evolving, the challenges they’re addressing, and the opportunities they unlock for building truly open architectures. Join us for an engaging conversation on how open source innovation continues to push the boundaries of what’s possible in data platforms.

Panel

5:05 PM -6:00 PM

Networking Reception

Networking