Too Many Roundtrips: Metadata Overhead in the Modern Lakehouse
Motivation One of the challenges that comes with operating a high performance lakehouse query engine is metadata management. A query planner needs certain information about tables when planning a query, such as schema, statistics, location, etc. However, unlike a traditional data warehouse that has full ownership of all aspects of the data, modern data lakehouses […]