Increasing Performance with Arrow and Gandiva

In this talk, Vivek will start with an overview of how Arrow represents columnar data; and how it is more efficient on modern processors. Then he will introduce Gandiva and explain: 1) How it uses LLVM and generates optimized compiled code for expressions; and 2) How it leverages SIMD instructions to gain performance. For demonstration, Vivek will use Dremioi, to show how a Data Lake engine, can use Gandiva for improved SQL query processing power. To wrap things up, Vivek will give a glimpse of on-going work in Gandiva, such as a project to improve code generation.

Topics Covered

Apache Arrow
In-Memory Formats


Vivekanand Vellanki

Vivekanand Vellanki

Vivek Vellanki comes from a systems programming background, having worked at Microsoft and MapR prior to joining Dremio. His expertise is in the areas of distributed systems, performance, Hadoop and SQL query engines.

Ready to Get Started? Here Are Some Resources to Help

Case Study

When E-Commerce Explodes – The More Data the More Dremio

read more
On demand webinar graphic


Real-World Strategies to Optimize Data Platform Cost

read more
On-Demand webinar graphic


Centralize Data Security Governance on your Open Data Lakehouse with Dremio & Privacera

read more

Get Started Free

No time limit - totally free - just the way you like it.

Sign Up Now

See Dremio in Action

Not ready to get started today? See the platform in action.

Watch Demo

Talk to an Expert

Not sure where to start? Get your questions answered fast.

Contact Us