Apache Arrow: A New Gold Standard for Dataset Transport
This talk will discuss the role that Apache Arrow and Arrow Flight play in disrupting previous approaches to creating data services that transport large datasets. We’ll look at the technical details of why the Arrow protocol is an attractive choice and share specific examples of where Arrow has been employed for better performance and resource efficiency. We’ll also discuss the implications for the upcoming generation of data systems.
Wes McKinney is a software developer and entrepreneur focusing on analytical computing. He created the Python pandas project and is a co-creator of Apache Arrow. He authored two editions of the reference book, Python for Data Analysis. Wes is a member of The Apache Software Foundation and also a PMC member for Apache Parquet. He is now the CTO and co-founder of Voltron Data, a new startup working on accelerated computing technologies powered by Apache Arrow.