In this great article by ZDNet, Tony Baer explains how Arrow addresses the age-old problem of getting the compute-storage balance right for in-memory big data processing.
We liked this bit:
“Apache Arrow was conceived to solve a balance of system problem for data scientists: making sure that they didn’t run out of memory when running their models or run out of budget because they overallocated memory. “