Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Fundamentals of Analytics Engineering

You're reading from  Fundamentals of Analytics Engineering

Product type Book
Published in Mar 2024
Publisher Packt
ISBN-13 9781837636457
Pages 332 pages
Edition 1st Edition
Languages
Authors (7):
Dumky De Wilde Dumky De Wilde
Profile icon Dumky De Wilde
Fanny Kassapian Fanny Kassapian
Profile icon Fanny Kassapian
Jovan Gligorevic Jovan Gligorevic
Profile icon Jovan Gligorevic
Juan Manuel Perafan Juan Manuel Perafan
Profile icon Juan Manuel Perafan
Lasse Benninga Lasse Benninga
Profile icon Lasse Benninga
Ricardo Angel Granados Lopez Ricardo Angel Granados Lopez
Profile icon Ricardo Angel Granados Lopez
Taís Laurindo Pereira Taís Laurindo Pereira
Profile icon Taís Laurindo Pereira
View More author details

Table of Contents (23) Chapters

Preface 1. Prologue
2. Part 1:Introduction to Analytics Engineering
3. Chapter 1: What Is Analytics Engineering? 4. Chapter 2: The Modern Data Stack 5. Part 2: Building Data Pipelines
6. Chapter 3: Data Ingestion 7. Chapter 4: Data Warehousing 8. Chapter 5: Data Modeling 9. Chapter 6: Transforming Data 10. Chapter 7: Serving Data 11. Part 3: Hands-On Guide to Building a Data Platform
12. Chapter 8: Hands-On Analytics Engineering 13. Part 4: DataOps
14. Chapter 9: Data Quality and Observability 15. Chapter 10: Writing Code in a Team 16. Chapter 11: Automating Workflows 17. Part 5: Data Strategy
18. Chapter 12: Driving Business Adoption 19. Chapter 13: Data Governance 20. Chapter 14: Epilogue 21. Index
22. Other Books You May Enjoy

Summary

In this chapter, we have looked at both the problems of data quality and the potential solutions to improve it.

We saw that data quality issues come from three key areas. First, from the source system, where data might be incomplete, unreliable, or inconsistent. Second, from the infrastructure and pipelines that process and transform data as it is ingested. In that case, you might have issues with data quality because the data is too late, corrupted, or missing. It might also be that a mistake is made in the transformations, where, for example, we get the granularity or precision of the data wrong. Finally, data quality issues can arise from problems with data governance, most notably from inconsistencies in the definitions and documentation or a lack of access management, cost management, or metadata management. This all leads to a misunderstanding of the data and the data’s lineage and dependencies, which, in turn, leads to suboptimal decision-making and increased...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at €14.99/month. Cancel anytime}