March 2, 2023
9:35 am - 10:05 am PST
Fast Data Processing with Apache Arrow
Using Rust, Apache Arrow, and table formats, data can be efficiently processed closer to the hardware and without any pauses. This session will explain the pros and cons of Apache Arrow for data processing and compare the performance with Apache Spark — the “standard” in terms of distributed processing of big data. We will discuss the advantages of the Rust language, including Rust Arrow and the tools available, the missing pieces, and performance comparisons.
Topics Covered
Open Source