Gabriel Jakobson

Senior Solutions Architect, Dremio

Gabriel Jakobson's Articles and Resources

Blog Post

Exploring Cloud Data Lake Data Processing Options – Spark, EMR, Glue

Data processing is a critical part of the data pipeline. This article explores Apache Spark, Amazon EMR and AWS Glue and how each helps with data processing workloads in the data lake.



Data Preprocessing in Amazon Kinesis

Now take a look at the function called generate_metrics() that should generate random data. You want to generate and send information to the Kinesis about the metrics: requests, newly registered users, new orders and users churn over some period of time. The metrics are generated randomly, but they all are sent to Kinesis as comma-separated […]


Get Started Free

No time limit - totally free - just the way you like it.

Sign Up Now

See Dremio in Action

Not ready to get started today? See the platform in action.

Watch Demo

Talk to an Expert

Not sure where to start? Get your questions answered fast.

Contact Us