The Dremio API is designed around RESTful principles, so next we will define some wrapper functions for HTTP GET, POST, PUT, and DELETE. Generally, POST and PUT requests will take parameters. For example, creating a source will take a source input. Input configurations are detailed within the Models subheading of each endpoint. Make sure to […]
Tutorials
Looker is a popular platform for Business Intelligence and embedded analytics. Its web-based interface makes it easy for users to build powerful reports and visualizations on a wide range of data sources. Dremio is a self-service data platform. It runs between your data sources and your analytical tools, like Looker, to simplify and accelerate how […]
Amazon Simple Storage Service (S3) is a storage service that lets you store and access files of any size up to 5TB anywhere and at any time. Companies use S3 to store their data because it is highly scalable, reliable, and fast. In S3, the files you create and upload are stored in separate buckets […]
Amazon Simple Storage Service (S3) is a data lake service for storing files of any type and volume. It is valued for high availability and reliability, easy scaling, and fault-tolerance. S3 provides an unlimited space for storing files ranging in size from 1 byte to 5 terabytes. The files are stored in separate buckets, in […]
Data is nothing without analytics, but built-in features for analysis in BI tools like Qlik are often not robust enough to handle the scale of big data. Therefore, in a continuation of the tutorial series on how to use Dremio with Qlik, in the spotlight today is Hadoop, one of the basic technologies in the […]
Now you’ll see the S3 bucket called samples.dremio.com: Click on that folder to see a few files provided as samples. We’ll use the SF_Incidents2016.json data source for this tutorial, so click on that file: You’ll see a sample of the JSON, click OK to confirm the format. Now you’ll see a preview of this physical […]
What’s great is that most of these options are supported in Dremio, let’s take a closer look. Relational Elasticsearch Schema Mapping Database Index Table Type Row Document Column Field SQL DSL+Painless Aggregation Aggregation Projection Projection Boolean Boolean Primary Key _id Field Join Does not exist Foreign Key Does not exist Following the hierarchical namespace model, […]
The amount of information that industries must keep safe is ever-increasing due to how easy it is to collect data from customers, patients, employees, etc. Now more than ever, it is very critical that we ensure that data security and privacy remain a priority to protect against expensive threats. Dremio provides a powerful and flexible […]
The Elasticsearch Query DSL is a powerful and simple way to express queries in Elasticsearch using JSON. Painless is a simple, secure scripting language for inline and stored scripts. When considered together, it is possible to map most SQL queries to Elasticsearch efficiently and with high performance. In this tutorial we will look at how […]
Field Type Example IncidentNum String 170512983 Category String VEHICLE THEFT Descript String STOLEN AUTOMOBILE DayofWeek String Saturday Date String 06/24/2017 Time String 00:30 PdDistrict String SOUTHERN Resolution String NONE Address String 9TH ST / MISSION ST X String -122.414714295579 Y String 37.7762310404758 Location String (37.7762310404758°, -122.414714295579°) PdId Integer 17051298307021 To generate zip codes, we’ll need […]
Elasticsearch is a popular open source datastore that enables developers to query data using a JSON-style domain-specific language, known as the Query DSL. Elasticsearch’s scale-out architecture, JSON data model, and text search capabilities make it an attractive datastore for many applications. Dremio makes it easy to connect your favorite BI tools to Elasticsearch, including Tableau. […]
In this tutorial we’ll work with data provided by Yelp to explore the powerful data preparation features of Dremio. Unlike ETL or Data Prep tools, Dremio does not make copies of the data. Instead, users create virtual datasets, and all data transformation is performed on the fly as it is being accessed through Dremio’s Apache […]
In this tutorial we’ll show you how to share a Dremio query profile. Query profiles store important metadata about queries that you run in Dremio, and can make it easier to help Dremio’s engineers debug any issues you encounter. You will require access to a Dremio deployment and have at least one data source connected. […]
In this tutorial we’ll show you how to use Pandas with Dremio by working through a quantitative model for sports betting. We assume you are familiar with Python, and we assume you have access to a Dremio instance and are familiar with the basics. If you’re just getting started with Dremio we suggest you first […]
In this tutorial you will learn how to add users to Dremio so others can benefit from the ease which Dremio allows you to work with data. We’ll add and edit a user, and show you how you can administer users in the Dremio UI. To follow this tutorial you should have access to a […]