
Why and How Netflix Created and Migrated to a New Table Format: Iceberg
Netflix created Apache Iceberg to address the performance and the inherent issues as well as usability challenges of using Apache Hive tables in large and demanding data lake environments. In this session, join Ted Gooch, Database Architect at Netflix, as he explains:· How Iceberg was developed by Netflix to solve some of the inherent issues in the Hive table formatrn· How increased expressive partitioning capability allows for highly selective filters over large datarn· How data marts were required to use data directly in a data lake and how migrating to Iceberg allows workloads with low latencyrn· How the flexible format of Iceberg allows for evolution of both schema and partitioningrn· How the separation of logical table from physical layout allows for background optimization of storage
Topics Covered
Speakers

Ted Gooch
Ted works on the big data compute team at Netflix which manages a petabyte-scale Data-warehouse Platform that services analytic and reporting needs for the entire Netflix organization. He has a passion for solving problems in the big data space. At Netflix, Ted partners with data engineers, data scientists and analytic engineers to leverage the newest platform features for large scale, high performance advanced analytics. Ted holds a Masters in Computer Science from the University of Southern California and is an Apache Iceberg committer.