Dremio Unveils Groundbreaking Apache Iceberg Book and Enhanced Lakehouse Capabilities

May 20, 2024

SANTA CLARA, Calif., May 20, 2024 – Dremio, the unified lakehouse platform for self-service analytics, is thrilled to announce the release of the first-ever book on Apache Iceberg, authored by industry experts Tomer Shiran and Alex Merced. This landmark publication provides an in-depth guide to Apache Iceberg, offering valuable insights and practical knowledge for data professionals.

As the first comprehensive resource on the subject, the book demystifies Apache Iceberg, an open table format for huge analytic datasets. Readers will gain a thorough understanding of its architecture, capabilities, and best practices for implementation. This essential guide is poised to become the go-to reference for anyone looking to leverage the power of Apache Iceberg in modern data architectures.

In addition to this exciting publication, Dremio has also made several key announcements around Apache Iceberg:

  1. Support for Apache Iceberg Kafka Connector Sink for Real-Time Ingestion: Dremio now supports the Apache Iceberg Kafka Connect sink with its Lakehouse Catalog, powered by the robust open-source Nessie transactional catalog. This integration, contributed to Apache Iceberg by dedicated contributors and pending PMC approval, enhances real-time data ingestion capabilities. Comprehensive documentation and tutorials will be made available once the contribution is finalized.
  2. Expanded Deployment Options: Dremio’s Apache Iceberg lakehouse now supports deployment in any environment—cloud, on-premise, or hybrid. This expansion includes highly regulated, Air Gap network secured, and data sovereignty governed settings, ensuring that organizations can leverage Iceberg’s capabilities while meeting stringent compliance and security requirements.
  3. Commitment to Open Source with Nessie Integration: Dremio has reaffirmed its dedication to open source by incorporating Nessie into Dremio Software, in addition to the managed Nessie capabilities already offered in Dremio Cloud. These capabilities simplify data engineering with Git-like workflows on lakehouse data, enabling users to run production workloads with end-to-end Dremio support for a Nessie-native Apache Iceberg catalog.

"We're excited to not only provide the first-ever book on Apache Iceberg but also to enhance our platform with cutting-edge features that empower data professionals," said Alex Merced, Senior Technical Evangelist at Dremio. "Our expanded deployment options, real-time ingestion support, and deepened commitment to open source underscore our mission to simplify and accelerate data analytics for organizations worldwide."

For more information and to purchase the book, visit www.dremio.com.

About Dremio

Dremio is the unified lakehouse platform for self-service analytics and AI, serving hundreds of global enterprises, including Maersk, Amazon, Regeneron, NetApp, and S&P Global. Customers rely on Dremio for cloud, hybrid, and on-premises lakehouses to power their data mesh, data warehouse migration, data virtualization, and unified data access use cases. Based on open source technologies, including Apache Iceberg and Apache Arrow, Dremio provides an open lakehouse architecture enabling the fastest time to insight and platform flexibility at a fraction of the cost. To learn more visit www.dremio.com or follow the company on Linkedin.

get started

Get Started Free

No time limit - totally free - just the way you like it.

Sign Up Now
demo on demand

See Dremio in Action

Not ready to get started today? See the platform in action.

Watch Demo
talk expert

Talk to an Expert

Not sure where to start? Get your questions answered fast.

Contact Us

Ready to Get Started?

Bring your users closer to the data with organization-wide self-service analytics and lakehouse flexibility, scalability, and performance at a fraction of the cost. Run Dremio anywhere with self-managed software or Dremio Cloud.