Increasing Performance with Arrow and Gandiva

In this talk, Vivek will start with an overview of how Arrow represents columnar data; and how it is more efficient on modern processors. Then he will introduce Gandiva and explain: 1) How it uses LLVM and generates optimized compiled code for expressions; and 2) How it leverages SIMD instructions to gain performance. For demonstration, Vivek will use Dremioi, to show how a Data Lake engine, can use Gandiva for improved SQL query processing power. To wrap things up, Vivek will give a glimpse of on-going work in Gandiva, such as a project to improve code generation.

Topics Covered

Apache Arrow
In-Memory Formats

Ready to Get Started? Here Are Some Resources to Help


What Is a Data Lakehouse?

The data lakehouse is a new architecture that combines the best parts of data lakes and data warehouses. Learn more about the data lakehouse and its key advantages.

read more
Simplifying Data Mesh Featured Image


Simplifying Data Mesh for Self-Service Analytics on an Open Data Lakehouse

The adoption of data mesh as a decentralized data management approach has become popular in recent years, helping teams overcome challenges associated with centralized data architecture.

read more

Data Mesh in Practice: How MSK Accelerates Cancer Research with Dremio’s Open Data Lakehouse

Learn how MSK accelerates cancer research by enabling faster access to data using Dremio's open data lakehouse

read more

Get Started Free

No time limit - totally free - just the way you like it.

Sign Up Now

See Dremio in Action

Not ready to get started today? See the platform in action.

Watch Demo

Talk to an Expert

Not sure where to start? Get your questions answered fast.

Contact Us