Data Mastery Hub: Term Resource for Data Professionals
Whether you're a newcomer to the world of big data and data lakes or an experienced pro looking to expand your knowledge, the Dremio Wiki provides insights and guidance for all your data-related needs. Dive in and unlock the power of your data today!
Data Analysis
Roll-up Analysis
Explore Roll-up Analysis, its benefits, limitations, and integration into a data lakehouse environment for data science professionals.
Data Lake
Row-Based Databases
Row-Based Databases is a data storage and management approach that organizes data in rows, providing benefits for data processing and analytics.
Data Management
Run-Length Encoding
Run-Length Encoding is a simple data compression technique that reduces the size of data by replacing consecutive repeated values with a count and the value itself.
Data Management
Saga Pattern
Learn about the Saga Pattern, its advantages, and how it fits into a data lakehouse environment.
Data Management
Sampling
Sampling is a technique used to analyze a subset of data in order to make inferences about the entire dataset.
Data Architecture
Scalability
Explore the role of Scalability in data processing and analytics and how it integrates with a data lakehouse environment.
Data Analysis
Scalar Functions
Explore Scalar Functions, their functionality, advantages, and integration within a data lakehouse environment.
Data Engineering
Schema
Schema is a way to organize and define the structure of data in a database or data lakehouse.
Data Engineering
Schema Evolution
Schema Evolution is the process of modifying a database schema over time to adapt to changing data requirements.
Machine Learning
Schema Learning Engine
Schema Learning Engine is an automated technology that analyzes and understands the structure and relationships within data, enabling efficient data processing and analytics.
Data Management
Schema Registry
Explore Schema Registry, its benefits, and integration with Data Lakehouse environments for efficient data processing and analytics.
Data Management
Schema-on-Read
Schema-on-Read is a data processing approach that allows for flexibility in storing and analyzing data without predefined schema constraints.
Data Management
Schema-on-Read vs Schema-on-Write
Schema-on-Read vs Schema-on-Write is a comparison of two data processing approaches that impact how data is structured and processed for analytics.
Data Management
Schema-on-Write
Schema-on-Write is a data management approach that involves defining the structure and format of data before it is stored.
Data Management
Search Engine
Search Engine is a software or tool designed to retrieve and present information from a database or collection of indexed web pages in response to a user's query.