Dremio Jekyll

Python on HDFS

Dremio makes it easy to connect HDFS to your favorite BI and data science tools, including Python. And Dremio makes queries against HDFS up to 1,000x faster. Dremio:

  • Makes your data easy, approachable, and interactive – gigabytes, terabytes or petabytes, no matter where it's stored. Dremio optimizes your data so you don't have to.
  • Reduces the need for ETL and data warehouses, and replaces cubes and extracts.
  • Helps your teams help themselves, while extending your governance and security controls.

Dremio is open source, and free to use.

Deploy today learn more about Dremio

Python

Python is an interpreted, high-level, general-purpose programming language – widely used for analysis of large-scale datasets.

Python

The Hadoop Distributed File System (HDFS) is the primary data storage system used by Hadoop applications..

Dremio lets you do more with Python and with HDFS

Connect Python to more sources

Connect HDFS to more tools