Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Data Engineering with Google Cloud Platform

You're reading from  Data Engineering with Google Cloud Platform

Product type Book
Published in Mar 2022
Publisher Packt
ISBN-13 9781800561328
Pages 440 pages
Edition 1st Edition
Languages
Author (1):
Adi Wijaya Adi Wijaya
Profile icon Adi Wijaya

Table of Contents (17) Chapters

Preface Section 1: Getting Started with Data Engineering with GCP
Chapter 1: Fundamentals of Data Engineering Chapter 2: Big Data Capabilities on GCP Section 2: Building Solutions with GCP Components
Chapter 3: Building a Data Warehouse in BigQuery Chapter 4: Building Orchestration for Batch Data Loading Using Cloud Composer Chapter 5: Building a Data Lake Using Dataproc Chapter 6: Processing Streaming Data with Pub/Sub and Dataflow Chapter 7: Visualizing Data for Making Data-Driven Decisions with Data Studio Chapter 8: Building Machine Learning Solutions on Google Cloud Platform Section 3: Key Strategies for Architecting Top-Notch Data Pipelines
Chapter 9: User and Project Management in GCP Chapter 10: Cost Strategy in GCP Chapter 11: CI/CD on Google Cloud Platform for Data Engineers Chapter 12: Boosting Your Confidence as a Data Engineer Other Books You May Enjoy

Practicing developing a data warehouse

Now we are set and ready to build our first data warehouse. We will proceed with the help of two scenarios. Each scenario will have different learning purposes.

In the first scenario, we are going to focus on how to use the tools. After understanding the tools, in the second scenario, we will focus on the principles. Principles here mean that even though we know how to use the tools, there are many possibilities for using them. If we are talking about principles, there is no right or wrong answer. What we can do is to learn from common patterns and a number of theories. 

We will use the San Francisco bike-sharing dataset. The dataset relates to a bike-sharing company. The company records the trip data of its members. Each bike trip contains information about the stations, and lastly, each station is located in certain regions. This dataset is very simple compared to a real-world data warehouse, but for practice purposes, this is a good...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}