Webinars

Flexible Data Lake Architectures for Seamless Real-time Data and Machine Learning Integrations

This talk was born from some of our greatest victories won and worst losses suffered while designing and implementing data lakes, with a focus on real-time processing and machine learning pipeline integration. We will go through the various design problems spawned from the specific integrations and solutions we have used—from caching to avert the Slowly Changing Dimension problem through operational and analytical cluster separation to the fully-fledged MLOps process. We will showcase, using real examples, how those use cases are reflected in the data lake architecture, both when building from scratch and evolving an existing solution.For the data architect, this session will provide a greater understanding of available design patterns. To a data scientist, it will provide a better understanding of the soon-to-be working environment.

Topics Covered

Data Lake Storage

Speakers

Kamil Owczarek

Kamil Owczarek

Kamil Owczarek is the Head of Big Data at GFT Poland and a data engineer at heart. Kamil specializes in projects connected to stream processing and machine learning. His personal motto: “Data is always more important than you recognize it to be”.

Piotr Kosmowski

Piotr Kosmowski

Piotr Kosmowski is a solution architect at GFT. He is experienced with the design and delivery of multi-tier solutions for the financial sector including microservices, public APIs and data lake in both on-premises and cloud environments. Working closely with the client, business and development team, Piotr helps improve organization delivery processes with an agile mindset.

Ready to Get Started? Here Are Some Resources to Help

Case Study

Case Study

Dremio Supports Moonfare’s High-Performance Culture with a High-Performance Lakehouse

Moonfare replaced a PostgreSQL-based data warehouse on Amazon Web Services (AWS) with a Dremio data lakehouse to offer data engineers, analysts and business users a high performance platform for business intelligence and predictive analytics empowering them to make better data-driven decisions.

read more

Case Study

Case Study: DB Cargo Gives Users the Green Light to All Data with Dremio

Deutsche Bahn Group (DB) is one of the world's leading mobility and logistics companies. The DB Cargo business unit manages DB's rail freight business.

read more
Case Study

Case Study

Case Study: Amazon Accelerates Supply Chain Decision Making with Dremio

Amazon's Supply Chain Finance Analytics team developed a new analytics architecture with Dremio to simplify ETL processes, accelerate queries, and provide analytics on a unified view of the data.

read more

Get Started Free

No time limit - totally free - just the way you like it.

Sign Up Now

See Dremio in Action

Not ready to get started today? See the platform in action.

Watch Demo

Talk to an Expert

Not sure where to start? Get your questions answered fast.

Contact Us