DREMIO CAPABILITIES
Enterprise Catalog
Unified Metadata Management for a Governed Iceberg Lakehouse - Deployable Anywhere
The Enterprise Catalog powered by Apache Polaris (Incubating) is an Iceberg REST-compatible catalog that provides unified metadata management, fine-grained access controls, and data lineage, enabling a fully governed and high-performance Iceberg lakehouse—which is deployable anywhere.
Challenge
The Complexity of Modern Data Governance
As organizations build data lakehouses with Apache Iceberg, they face significant challenges in managing metadata and governance.
Fragmented Metadata
Multiple engines creates inconsistent security policies and broken data lineage, forcing teams to manually maintain governance across platforms.
Growing AI and ML Demands
Require data infrastructures that can rapidly prepare high-quality data while maintaining proper governance across all access points.
Costly deployment and management
Open-source catalog solutions drain resources as teams troubleshoot infrastructure rather than driving innovation with data.
Solution
Unified Governance, Multi-Engine Freedom
The Enterprise Catalog powered by Apache Polaris delivers an Iceberg REST Spec compliant metastore that unifies governance and works with all compatible engines.
Centralized Metadata
Provides enterprise-grade security, governance, and data discovery across your entire data ecosystem.
Seamless Integration
With any Iceberg-compatible engine eliminates vendor lock-in while maintaining consistent governance policies.
Flexible Deployment Options
For cloud, on-premises, or hybrid environments address data sovereignty needs and complex infrastructure requirements.
benefits
Complete Control, Enhanced Collaboration, Lower Risk
The Enterprise Catalog transforms how organizations manage and govern their data lakehouse, delivering enterprise-grade capabilities with open-source flexibility.
- Multi-Engine Interoperability: Read and write from any Apache Iceberg REST-compatible engine including Spark, Flink, Dremio, and more - with seamless concurrent access
- Cross-Engine Compatibility: Leverage standardized REST API implementation to work with your preferred query engines while maintaining consistent governance policies
- Hybrid Deployment Support: Run your catalog in any environment - cloud, on-premises, or hybrid - addressing data sovereignty requirements and complex infrastructure needs
- AI-Ready Data Discovery & Infrastructure: Rapidly prepare, access, and utilize data for AI and ML initiatives with properly governed and easily discoverable datasets
open-source
Powered by Apache Polaris (incubating)
The Open Standard for Lakehouse Catalogs
Apache Polaris (incubating) is an open-source, fully-featured catalog for Apache Iceberg™ that revolutionizes how organizations manage their data lakehouse environments. As a community-driven project under the Apache Software Foundation, Polaris implements Iceberg's REST API to enable seamless multi-engine interoperability across platforms including Apache Spark, Apache Flink, Dremio, and more.
Why Apache Polaris Matters
Unlike vendor-controlled catalogs, Polaris provides a truly open alternative that puts control back in your hands:
- True Multi-Engine Support: Access and manage your Iceberg tables from any REST-compatible query engine with consistent governance across all of them
- Centralized Metadata Management: Organize and track all metadata for your Iceberg tables in one secure location
- Enterprise-Grade Security: Implement robust role-based access controls (RBAC) for catalogs, namespaces, and tables
- Cloud-Agnostic Flexibility: Works seamlessly with AWS S3, Google Cloud Storage, Azure, and more
- Atomic Operations: Ensures data consistency and reliability across your entire data estate

CAPABILITIES
How Enterprise Catalog Works
Iceberg Clustering works natively with Apache Iceberg and seamlessly integrates with Dremio's table maintenance commands for simple, effective data organization.
- Foundation & Compatibility: Built on Apache Polaris, the catalog implements the industry-standard Iceberg REST API for seamless integration with Spark, Flink, Dremio, and other engines, ensuring maximum interoperability and eliminating vendor lock-in.
- Comprehensive Security & Governance: Employs enterprise-grade Role-Based Access Control (RBAC) with fine-grained permissions at catalog, namespace, and table levels, alongside column masks and row filters to protect sensitive data while maintaining appropriate access.
- Flexible Deployment & Storage: Supports hybrid, multi-cloud, and on-premises deployment models with integrations for AWS S3, Azure Storage, GCS, and S3-compatible storage options like MinIO and Ceph, addressing data sovereignty requirements.
- Advanced Management & Discovery: Features an intuitive UI for catalog management, automated table maintenance and optimization, intelligent data discovery, and catalog federation capabilities - functioning as a "Catalog of Catalogs" that supports Hive, Glue, and other external catalogs.
Learn More About Dremio