July 29, 2020

Apache Arrow: A New Gold Standard for Dataset Transport

This talk will discuss the role that Apache Arrow and Arrow Flight play in disrupting previous approaches to creating data services that transport large datasets. We’ll look at the technical details of why the Arrow protocol is an attractive choice and share specific examples of where Arrow has been employed for better performance and resource efficiency. We’ll also discuss the implications for the upcoming generation of data systems.

Topics Covered

Apache Arrow Flight

Dremio Subsurface for Apache Arrow

In-Memory Formats

Speakers

Wes McKinney

Wes McKinney is a software developer and entrepreneur focusing on analytical computing. He created the Python pandas project and is a co-creator of Apache Arrow. He authored two editions of the reference book, Python for Data Analysis. Wes is a member of The Apache Software Foundation and also a PMC member for Apache Parquet. He is now the CTO and co-founder of Voltron Data, a new startup working on accelerated computing technologies powered by Apache Arrow.

Apache Arrow: A New Gold Standard for Dataset Transport

Speakers

Discover How Apache Arrow Accelerates AI and Analytics with Unified, AI-Ready Data Products

Get Started Free

See Dremio in Action

Talk to an Expert

Ready to Get Started?