November 13, 2025

Apache Arrow, the Hostage Negotiator: Revisiting the case for Client Protocol Redesign

In 2017, Mark Raasveldt and Hannes Mühleisen (who went on to create DuckDB) presented a VLDB paper entitled “Don’t Hold My Data Hostage – A Case For Client Protocol Redesign.” Their paper proposed the use of columnar serialization to achieve order-of-magnitude improvements in query result transfer performance. Eight years later, this talk revisits Raasveldt and Mühleisen’s argument and describes the central role that the Apache Arrow project has played in realizing this vision—through the dissemination of Arrow IPC, Arrow Flight, Arrow Flight SQL, Arrow over HTTP, and ADBC across numerous open source and commercial query systems. The talk concludes with a call to action to introduce Arrow-based transport to the systems that continue to “hold data hostage.”

Topics Covered

Business Intelligence
Data Analytics
ELT/ETL
In-Memory Formats
Modernization and Migration
Open Source
Use Cases

Sign up to watch all Subsurface 2025 sessions