March 2, 2023

9:35 am - 10:05 am PST

Fast Data Processing with Apache Arrow

Using Rust, Apache Arrow, and table formats, data can be efficiently processed closer to the hardware and without any pauses. This session will explain the pros and cons of Apache Arrow for data processing and compare the performance with Apache Spark — the “standard” in terms of distributed processing of big data. We will discuss the advantages of the Rust language, including Rust Arrow and the tools available, the missing pieces, and performance comparisons.

Topics Covered

Open Source

Sign up to Watch On-Demand