Get Started

The Dremio SQL Lakehouse Platform

Enable high-performing BI dashboards and interactive analytics directly on the data lake, and eliminate the need for data warehouses.

Interactive Performance, Directly on the Lakehouse

Dremio is the only SQL lakehouse platform built from the ground up to deliver high-performing BI dashboards and interactive analytics directly on the data lake.
Deploy and run Dremio anywhere — AWS, Azure or in your environment. Enable high-performing BI dashboards and interactive analytics directly on your data lake, and eliminate the need for data warehouses.
GO
GO
GO
  • Columnar Cloud Cache (C3)
  • Data Reflections
  • Apache Arrow
C3 image

Columnar Cloud Cache (C3)

Columnar Cloud Cache (C3) enables NVMe-level I/O performance on S3/ADLS/GCS by leveraging NVMe/SSD built into cloud compute instances, like Amazon EC2 and Azure Virtual Machines.

By selectively caching data required to satisfy client workloads, C3 also eliminates over 90% of S3/ADLS/GCS I/O costs, which can make up 10-15% of the costs of each query.
Data Reflections product screenshot

Data Reflections

Data Reflections are data structures that intelligently precompute aggregations and other optimizations on data, so you don’t have to do complex aggregations on the fly.

Reflections are completely transparent to end users — analysts query their tables and views directly, and the Dremio optimizer picks the best Reflections to satisfy the query.

And, Reflections are easy to create and are automatically refreshed, so you don’t have to worry about their lifecycle.
Apache Arrow diagram

Apache Arrow

Dremio is a columnar engine powered by Apache Arrow, the open source standard for columnar, in-memory computing (which we co-created).

Dremio leverages Arrow Gandiva to compile queries to hardware-optimized native code that maximizes CPU utilization with vectorized execution directly on Arrow buffers.

In addition, Dremio leverages Arrow Flight to deliver 20x faster performance than ODBC and JDBC, so you can analyze data at scale faster than ever.

Governed, Self-Service Data Access

Empower self-service data access for users while centralizing security and governance through a shared semantic layer.
  • With Dremio
  • Without Dremio

Unlimited Concurrency

Tackle multiple concurrent workloads and support the needs of thousands of analysts with an elastic, multi-engine architecture that scales infinitely.
  • 60%
    Lower compute costs
    Eliminate the need to over-provision infrastructure with right-sized engines that automatically start, stop and scale based on current workload demands.
  • 0
    No noisy neighbors
    Engines are physically isolated, so workloads can run independently without bottlenecks and resource contention.
  • 100%
    Control of resources
    Manage resource allocation by using workload management rules to route queries to engines.

Built for the Modern Enterprise

  • Advanced enterprise security

    Deliver insights safely with security built-in at every layer, from end-to-end data encryption to passwordless authentication.
  • Seamless integration with BI tools

    Connect your favorite BI tools like Tableau and Power BI to create interactive dashboards that bring your data lake to life.
  • Manage everything via SQL or REST API

    Use SQL to administer Dremio objects, and use APIs to create scripts to automate Dremio processes.
  • ANSI SQL

    Easily run all your existing queries and reports without rewrites with comprehensive support for ANSI SQL.

Build the Foundation for an Open Data Architecture

Build an open architecture that gives you the flexibility to use your favorite tools and easily adopt future innovation.

Enjoy Dremio Anywhere

  • Dremio Cloud

    Leverage a fully-managed SQL lakehouse platform.

    Learn More
  • Dremio Software

    Deploy and run Dremio in your environment.

    Learn More

Ready for an amazing BI experience?

Test Drive Deploy Now
Gnarly Surfing