Data Mastery Hub: Term Resource for Data Professionals
Whether you're a newcomer to the world of big data and data lakes or an experienced pro looking to expand your knowledge, the Dremio Wiki provides insights and guidance for all your data-related needs. Dive in and unlock the power of your data today!
Data Lake
Elasticsearch
Elasticsearch is a distributed, open-source search and analytics engine built on Apache Lucene.
Data Search and Indexing
Elasticsearch Document
Elasticsearch Document is a unit of data stored in Elasticsearch that consists of a JSON object containing one or more fields. It is used for data indexing, searching, and retrieval in Elasticsearch-based applications.
Data Search and Indexing
Elasticsearch Indexes
Elasticsearch Indexes is a data structure that organizes and stores large volumes of data for quick and efficient search and analysis.
Data Search and Indexing
Elasticsearch Mapping
Elasticsearch Mapping is a feature that defines the way data is indexed and stored in Elasticsearch, providing structured data processing capabilities for efficient search and analytics.
Data Search and Indexing
Elasticsearch Painless
Elasticsearch Painless is a scripting language designed for data processing and analytics in Elasticsearch, offering powerful features and flexibility.
Data Search and Indexing
Elasticsearch Scope
Elasticsearch Scope is a feature in Dremio that allows users to optimize, update, or migrate data from Elasticsearch to a data lakehouse environment efficiently.
Data Lake
Elasticsearch Sharding
Elasticsearch Sharding is a technique used to distribute and divide data across multiple nodes or servers in an Elasticsearch cluster.
Data Search and Indexing
Elasticsearch Type
Elasticsearch Type is a data structure for organizing and storing documents in Elasticsearch.
Data Integration
ELT
ELT is a data processing approach where data is extracted, loaded, and transformed to a target system, optimizing data processing and analytics.
ETL
ELT Pipelines
ELT Pipelines is a data processing approach that involves Extracting, Loading, and Transforming data to a data lakehouse environment, enabling efficient analytics and data management.
Data Storage
Embedded Database
Embedded Database is a software component that is integrated within an application, providing local data storage and management capabilities.
AI
Embedding Layer
Embedding Layer is a technique used in machine learning to represent categorical variables as dense vectors of continuous values.
Data Management
Encapsulation
Encapsulation is a data management technique that combines data storage and data processing capabilities in a single unified system.
Data Storage
End-user Database
End-user Database is a centralized data storage system that allows end-users to directly access and manipulate their data without the need for IT intervention.
Data Engineering
Enrichment
Enrichment is the process of enhancing and augmenting raw data with additional information to improve data processing and analytics.