The two key components of any data pipeline are data lakes and warehouses. This course highlights use-cases for each type of storage and dives into the available data lake and warehouse solutions on Google Cloud Platform in technical detail. Also, this course describes the role of a data engineer, the benefits of a successful data pipeline to business operations, and examines why data engineering should be done in a cloud environment. Learners will get hands-on experience with data lakes and warehouses on Google Cloud Platform using QwikLabs.
This module gives an introduction of the Data Engineering specialization and the course.
Introduction to Data Engineering
This module describes the role of a data engineer and motivates the claim why data engineering should be done in the Cloud
Building a Data Lake
In this module, we describe what data lake is and how to use Google Cloud Storage as you data lake on GCP
Building a data warehouse
In this module, we talk about BigQuery as a data warehousing option on GCP
This module reviews all the topics covered in the course