Implementing a Data Lakehouse Architecture in AWS — Part 1 of 4
The Data Lakehouse is a new architecture that combines the flexibility, low cost, and scale of data lakes with the data warehouses' power.

Search for a command to run...

Series
In this series, I will talk about some options to implement the Data Lakehouse architecture in AWS, we will use a wide range of resources including some Open Source leading products.
The Data Lakehouse is a new architecture that combines the flexibility, low cost, and scale of data lakes with the data warehouses' power.

Introduction In part 1 of this article series, we walked through how to feed a Data Lake built on top of Amazon S3, based on streaming data, using Amazon Kinesis. In part 2, we will cover all of the steps needed to build a Data Lakehouse, using trip ...

Introduction In our previous article, part 2 of the series, we walked through the extraction, processing, and creation of some data mart, using the New York City taxi trip data which is publicly available to do consumption. We used some of the princi...
