Fugue SQL: Extending SQL Interface for End-to-End Data Pipelines
Fugue SQL is an open source SQL interface for Python compute frameworks such as Pandas, Spark, Dask and Blazing SQL.As Dremio has made data engineering easier, FugueSQL makes data analysis with Python easier for SQL lovers. With Fugue SQL, users can utilize SQL as a grammar for end-to-end data workflows. In this demo we’ll go over how to get started in leveraging distributed computing to process big data with Fugue SQL and various backends.
Han Wang is the Tech Lead of Lyft’s Machine Learning Platform, focusing on distributed computing solutions. Before joining Lyft, he worked at Microsoft, Hudson River Trading, Amazon and Quantlab. Han is the founder of the Fugue project, aimed at democratizing distributed computing and machine learning.