Data Mastery Hub: Term Resource for Data Professionals

Whether you're a newcomer to the world of big data and data lakes or an experienced pro looking to expand your knowledge, the Dremio Wiki provides insights and guidance for all your data-related needs. Dive in and unlock the power of your data today!

Data Lake

Elasticsearch

Elasticsearch is a distributed, open-source search and analytics engine built on Apache Lucene.

Data Search and Indexing

Elasticsearch Document

Elasticsearch Document is a unit of data stored in Elasticsearch that consists of a JSON object containing one or more fields. It is used for data indexing, searching, and retrieval in Elasticsearch-based applications.

Data Search and Indexing

Elasticsearch Indexes

Elasticsearch Indexes is a data structure that organizes and stores large volumes of data for quick and efficient search and analysis.

Data Search and Indexing

Elasticsearch Mapping

Elasticsearch Mapping is a feature that defines the way data is indexed and stored in Elasticsearch, providing structured data processing capabilities for efficient search and analytics.

Data Search and Indexing

Elasticsearch Painless

Elasticsearch Painless is a scripting language designed for data processing and analytics in Elasticsearch, offering powerful features and flexibility.

Data Search and Indexing

Elasticsearch Scope

Elasticsearch Scope is a feature in Dremio that allows users to optimize, update, or migrate data from Elasticsearch to a data lakehouse environment efficiently.

Data Lake

Elasticsearch Sharding

Elasticsearch Sharding is a technique used to distribute and divide data across multiple nodes or servers in an Elasticsearch cluster.

Data Search and Indexing

Elasticsearch Type

Elasticsearch Type is a data structure for organizing and storing documents in Elasticsearch.

Data Integration

ELT

ELT is a data processing approach where data is extracted, loaded, and transformed to a target system, optimizing data processing and analytics.

ETL

ELT Pipelines

ELT Pipelines is a data processing approach that involves Extracting, Loading, and Transforming data to a data lakehouse environment, enabling efficient analytics and data management.

Data Storage

Embedded Database

Embedded Database is a software component that is integrated within an application, providing local data storage and management capabilities.

AI

Embedding Layer

Embedding Layer is a technique used in machine learning to represent categorical variables as dense vectors of continuous values.

Data Management

Encapsulation

Encapsulation is a data management technique that combines data storage and data processing capabilities in a single unified system.

Data Storage

End-user Database

End-user Database is a centralized data storage system that allows end-users to directly access and manipulate their data without the need for IT intervention.

Data Engineering

Enrichment

Enrichment is the process of enhancing and augmenting raw data with additional information to improve data processing and analytics.

1 2 3 4 31 32 33 34 35 60 61 62 63
No Wikis Found
Topics
get started

Get Started Free

No time limit - totally free - just the way you like it.

Sign Up Now
demo on demand

See Dremio in Action

Not ready to get started today? See the platform in action.

Watch Demo
talk expert

Talk to an Expert

Not sure where to start? Get your questions answered fast.

Contact Us

Make data engineers and analysts 10x more productive

Boost efficiency with AI-powered agents, faster coding for engineers, instant insights for analysts.