Dremio Jekyll

Deploy Dremio On Prem

Dremio’s Data Lake Engine delivers lightning fast query speed and a self-service semantic layer operating directly against data lake storage.

Deploy via YARN Deploy via Kubernetes Deploy via Docker Evaluate via Linux RPM Evaluate via TAR Evaluate via Dremio University

Lightning-fast queries directly on data lake storage

Apache Arrow, Data Reflections, and other Dremio technologies work together to speed up queries by up to 1,000x.

  1. Arrow Flight

    Parallel zero-copy RPC between client & Dremio

  2. Columnar Execution

    Elastic Apache Arrow-based vectorized execution

  3. Data Reflections

    Patent-pending indexing & aggregation technology

  4. Columnar Cloud Cache (C3)

    Real-time, distributed NVMe caching & prefetching

Self-service semantic layer

Analysts and data scientists can discover, explore and curate data using Dremio’s intuitive UI, while IT maintains governance and security.

Learn more

Join with anything

Powerful joining abilities mean that your data is always accessible without ETL. Join across clouds, including any mix of private cloud, public cloud, and on-premises storage. Dremio ships with over a dozen connectors, and Dremio Hub includes many other community-developed connectors.

Learn more

Flexible and open

Dremio accesses your existing data where it’s stored now. You don't have to send your data to Dremio, or have it stored in proprietary formats that lock you in.

Learn more

Looking for another edition?