Dremio Jekyll

Dremio is self service data. Make your tools better and your teams more productive.

Request a Demo

Dremio provides a quantum leap in performance, based on four areas of innovation.

  • Apache Arrow Execution

    From 1 to 1000+ nodes, run on dedicated infrastructure or in your Data Lake, in the cloud or on-prem.

  • Dremio Reflections™

    Optimized physical data structures that accelerate data and queries automatically, up to 1000x faster.

  • Native Push-Downs

    Optimized query semantics for each data source – relational, NoSQL, HDFS, Amazon S3, and more.

  • Universal Relational Algebra

    Cost-based query planner automatically substitutes query plans to make optimal use of Data Reflections™.


Analyze all your data from one place. With any tool, instantly.

  • Native integrations to Relational, NoSQL, Hadoop, S3, and more.

  • Optimized query push downs for all sources.

  • Live connect with any BI tool, Python, R, or SQL. No extracts, no cubes.


A new perspective on Data Lineage, across all your data and tools.

  • Visualize how your data is queried, transformed, and joined across sources.

  • Analyze data lineage across your data lake, other sources, and data pipelines.

  • Understand the impact of security threats and error remediation downstream.


Curate your data and share with your team. Build together.

  • Search and discover data from all your sources.

  • Filter, transform, and join any source to curate data for your needs.

  • Share your work with your team to build together.


Dremio runs in your data lake, or clustered natively on dedicated infrastructure, in the cloud or on-prem.

On Hadoop or Dedicated Infrastructure