Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Data Engineering with Google Cloud Platform - Second Edition

You're reading from  Data Engineering with Google Cloud Platform - Second Edition

Product type Book
Published in Apr 2024
Publisher Packt
ISBN-13 9781835080115
Pages 476 pages
Edition 2nd Edition
Languages
Author (1):
Adi Wijaya Adi Wijaya
Profile icon Adi Wijaya

Table of Contents (19) Chapters

Preface 1. Part 1: Getting Started with Data Engineering with GCP
2. Chapter 1: Fundamentals of Data Engineering 3. Chapter 2: Big Data Capabilities on GCP 4. Part 2: Build Solutions with GCP Components
5. Chapter 3: Building a Data Warehouse in BigQuery 6. Chapter 4: Building Workflows for Batch Data Loading Using Cloud Composer 7. Chapter 5: Building a Data Lake Using Dataproc 8. Chapter 6: Processing Streaming Data with Pub/Sub and Dataflow 9. Chapter 7: Visualizing Data to Make Data-Driven Decisions with Looker Studio 10. Chapter 8: Building Machine Learning Solutions on GCP 11. Part 3: Key Strategies for Architecting Top-Notch Solutions
12. Chapter 9: User and Project Management in GCP 13. Chapter 10: Data Governance in GCP 14. Chapter 11: Cost Strategy in GCP 15. Chapter 12: CI/CD on GCP for Data Engineers 16. Chapter 13: Boosting Your Confidence as a Data Engineer 17. Index 18. Other Books You May Enjoy

Exercise – publishing event streams to Pub/Sub

In this exercise, we will try to stream data from Pub/Sub publishers. The goal is to create a data pipeline that can stream the data to a BigQuery table, but instead of using a scheduler (as we did in Chapter 4, Building Workflows for Batch Data Loading Using Cloud Composer), we will submit a Dataflow job that will run as an application to flow data from Pub/Sub to a BigQuery table. In the exercise, we will use the bike-sharing dataset we used in Chapter 3, Building a Data Warehouse in BigQuery. Here are the overall steps we will cover:

  1. Creating a Pub/Sub topic.
  2. Creating and running a Pub/Sub publisher using Python.
  3. Creating a Pub/Sub subscription.

We’ll start by creating a Pub/Sub topic.

Creating a Pub/Sub topic

We can create Pub/Sub topics using many approaches – for example, using the GCP console, the gcloud command, or through code. As a starter, let’s use the GCP console. Proceed...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at €14.99/month. Cancel anytime}