Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Data Engineering with Google Cloud Platform

You're reading from  Data Engineering with Google Cloud Platform

Product type Book
Published in Mar 2022
Publisher Packt
ISBN-13 9781800561328
Pages 440 pages
Edition 1st Edition
Languages
Author (1):
Adi Wijaya Adi Wijaya
Profile icon Adi Wijaya

Table of Contents (17) Chapters

Preface Section 1: Getting Started with Data Engineering with GCP
Chapter 1: Fundamentals of Data Engineering Chapter 2: Big Data Capabilities on GCP Section 2: Building Solutions with GCP Components
Chapter 3: Building a Data Warehouse in BigQuery Chapter 4: Building Orchestration for Batch Data Loading Using Cloud Composer Chapter 5: Building a Data Lake Using Dataproc Chapter 6: Processing Streaming Data with Pub/Sub and Dataflow Chapter 7: Visualizing Data for Making Data-Driven Decisions with Data Studio Chapter 8: Building Machine Learning Solutions on Google Cloud Platform Section 3: Key Strategies for Architecting Top-Notch Data Pipelines
Chapter 9: User and Project Management in GCP Chapter 10: Cost Strategy in GCP Chapter 11: CI/CD on Google Cloud Platform for Data Engineers Chapter 12: Boosting Your Confidence as a Data Engineer Other Books You May Enjoy

Exercise – using GCP in AutoML to train an ML model

As we learned earlier in this chapter, AutoML is an automated way for you to build an ML model. It will handle the model selection, hyperparameter tuning, and various data preparation steps. Specifically for the data preparation part, it will not be smart enough to transform data from very raw tables and automatically create clean features. What AutoML will do, however, is perform simple data preparation tasks, such as detecting numeric, binary, categorical, and text features, and then apply the required transformation to be used in the ML training process. Let's learn how to do this. Here are the steps that you will complete in this exercise:

  • Create Vertex AI datasets.
  • Set up the AutoML training.

For the use case and dataset, we will use the credit card default dataset from our previous exercise. First, go to your GCP console and find and click Vertex AI. If you haven't enabled the API, there will...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}