Today, we’re excited to announce our Dremio October 2021 (19.0) release!
This month’s release expands our support for analyzing Apache Iceberg tables, broadens our support for analytical SQL functions, and establishes Dremio as the first query engine outside of AWS to integrate with AWS Lake Formation.
AWS Lake Formation Integration
Companies can configure Dremio to verify data access privileges with various authorization clients. For example, Dremio supports Ranger-based authorization for Hive data sources, where Dremio checks Ranger permissions at runtime to allow or disallow access to Hive data.
Within the AWS ecosystem, AWS Lake Formation provides access controls for datasets in AWS Glue, and is used by customers as a central location to define security policies that are shared across multiple tools. With this release, we’re excited to announce that customers can now leverage access permissions defined in Lake Formation to provide secure access to Glue datasets.
Once configured, Dremio verifies with Lake Formation at runtime if a user can access a Glue dataset.
We’re particularly excited about our new Lake Formation integration, as it establishes Dremio as the first query engine outside of AWS to integrate with Lake Formation. This is just one of many areas where we partner closely with Amazon to deliver integration between our services, collaborate on new features, and ultimately enable AWS customers to build next-generation data architectures.
For more details and instructions on how to enforce Lake Formation permissions on new and existing Glue sources, check out the Lake Formation page in the Dremio docs.
Try Dremio’s Interactive Demo
Explore this interactive demo and see how Dremio's Intelligent Lakehouse enables Agentic AI
Broader Coverage for Analytical SQL Functions
Aside from developing first-class integration with leading ecosystem tools, we’re always working to empower companies to run analytics on their data as fast and as easy as possible. With this release, customers can now run a broader range of analytical SQL queries, such as filtered aggregates and PIVOT/UNPIVOT.
Extending Dremio’s SQL support makes analytics easier for users. For example, consider the following aggregation query, which calculates average selling prices for sales regions, where sales regions are defined on column values that meet a certain criteria:
SELECT
SUM(CASE WHEN region IN 'NE', 'SE' THEN selling_price ELSE 0.0 END) / SUM(CASE WHEN region IN 'NE', 'SE' THEN 1.0 ELSE 0.0 END) AS EastASP,
SUM(CASE WHEN region IN 'NW', 'SW' THEN selling_price ELSE 0.0 END) / SUM(CASE WHEN region IN 'NW', 'SW' THEN 1.0 ELSE 0.0 END) AS WestASP,
SUM(CASE WHEN product_type = 'installed' THEN selling_price ELSE 0.0 END) / SUM(CASE WHEN product_type = 'installed' THEN 1.0 ELSE 0.0 END) AS InstalledASP
FROM source.customer_transactions WHERE ...
Customers can now greatly simplify the above query to:
SELECT
AVG(selling_price) FILTER (WHERE region IN ('NE', 'SE')) AS EastASP,
AVG(selling_price) FILTER (WHERE region IN ('NW', 'SW')) AS WestASP,
AVG(selling_price) FILTER (WHERE product_type = 'installed') AS InstalledASP
FROM source.customer_transactions WHERE ...
For more information on Dremio’s SQL coverage, check out the SQL Functions page in the Dremio docs.
Apache Iceberg Support
Apache Iceberg is an open source table format for analytical datasets. Iceberg enables multiple applications to work together on the same data in a transactionally consistent manner, and defines additional information on the state of datasets as they evolve over time.
With this release, customers can now leverage Dremio’s native Iceberg support to analyze Iceberg tables which leverage a Hive Metastore. Dremio’s high-performance reader automatically identifies which datasets are saved in the Iceberg format, and utilizes the table information from Iceberg manifest files. We recommend using Hive Metastore as a metadata catalog when working with Iceberg tables in production environments.
This feature is generally available. If you’d like to learn more about how Dremio supports Apache Iceberg, visit the Apache Iceberg page in the Dremio docs.
We’re excited about the features and improvements we’ve made this month! For a complete list of additional new features, enhancements, changes, and fixes, check out the Release Notes. And, as always, we look forward to your feedback — please post any questions or comments on the Dremio Community!
Ingesting Data Into Apache Iceberg Tables with Dremio: A Unified Path to Iceberg
By unifying data from diverse sources, simplifying data operations, and providing powerful tools for data management, Dremio stands out as a comprehensive solution for modern data needs. Whether you are a data engineer, business analyst, or data scientist, harnessing the combined power of Dremio and Apache Iceberg will undoubtedly be a valuable asset in your data management toolkit.
Oct 12, 2023·Product Insights from the Dremio Blog
Table-Driven Access Policies Using Subqueries
This blog helps you learn about table-driven access policies in Dremio Cloud and Dremio Software v24.1+.
Aug 31, 2023·Dremio Blog: News Highlights
Dremio Arctic is Now Your Data Lakehouse Catalog in Dremio Cloud
Dremio Arctic bring new features to Dremio Cloud, including Apache Iceberg table optimization and Data as Code.