About this book
Knowing how to architect and implement complex data pipelines is a highly sought-after skill. Data engineers are responsible for building these pipelines and transforming data from one format to another so that it can be processed by a data analyst or data scientist to further work on. Amazon Web Services offers a range of tools to ease the job of a data engineer, making it the preferred platform for performing data engineering tasks.
This data engineering book will take you through the services and the skills you need to architect and implement data pipelines on AWS. You'll begin by understanding data engineering concepts and some of the core AWS tools that form a part of the data engineers toolkit. You'll then architect a data pipeline, review raw data sources, identify varied data consumers, and transform raw datasets to meet their needs. The book will show you how to populate data marts or data warehouses and how a data lakehouse fits into the picture. Next, you'll be introduced to some AWS tools for analyzing your data, including tools for ad-hoc SQL queries, and creating data visualizations and dashboards. In the final chapters, you'll perform predictive analytics using Amazon AI and machine learning tools.
By the end of this book, you'll be able to carry out data engineering tasks and implement a complex data pipeline on AWS independently.
- Publication date:
- November 2021