Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Practical Data Quality

You're reading from  Practical Data Quality

Product type Book
Published in Sep 2023
Publisher Packt
ISBN-13 9781804610787
Pages 318 pages
Edition 1st Edition
Languages
Author (1):
Robert Hawker Robert Hawker
Profile icon Robert Hawker

Table of Contents (16) Chapters

Preface Part 1 – Getting Started
Chapter 1: The Impact of Data Quality on Organizations Chapter 2: The Principles of Data Quality Chapter 3: The Business Case for Data Quality Chapter 4: Getting Started with a Data Quality Initiative Part 2 – Understanding and Monitoring the Data That Matters
Chapter 5: Data Discovery Chapter 6: Data Quality Rules Chapter 7: Monitoring Data Against Rules Part 3 – Improving Data Quality for the Long Term
Chapter 8: Data Quality Remediation Chapter 9: Embedding Data Quality in Organizations Chapter 10: Best Practices and Common Mistakes Index Other Books You May Enjoy

Preface

Practical Data Quality is about how to take your organization from a basic awareness of a data quality problem to a position of having data good enough to truly underpin success.

The book begins by explaining how bad data can affect an organization’s process efficiency, decision-making, and ability to remain compliant. It then establishes the key concepts you need to understand to be successful with data quality and the end-to-end process I have used to transform data throughout my career.

The book goes on to explain each step of the data quality journey, starting with creating a business case and managing the hectic period at the start of an initiative. Then the book establishes the typical stakeholders you will need to engage with through the process, how to work with them to identify which data to focus on, and the specific rules that the data should comply with.

Next, it shows how to monitor data against the rules that have been established and how to actually start correcting the data.

To close, the book explains how to embed good data quality practices into the day-to-day activities of your organization and outlines best practices and challenges to be avoided in your work.

By the end of the book, you will have a complete outline of how you can transform data quality in your organization, armed with examples to catch the interest of your stakeholders, and templates to accelerate your work.

Who this book is for

The book is aimed at anyone intending to improve data quality in their organization. The book outlines the basics of data quality for people new to the topic, but provides insights into every step of the data quality life cycle, using real-world examples and templates to accelerate progress. Typical readers are business leaders, such as chief operating officers or chief executive officers, who see data adversely affecting their success and data teams, such as analytics or governance teams who want to optimize their data quality approach.

What this book covers

Chapter 1, The Impact of Data Quality on Organizations, explains the importance of data quality and defines what is meant by bad data.

Chapter 2, The Basics of Data Quality, explains key data quality concepts, including the typical roles involved, the data quality improvement cycle, and the overall fit of a data quality initiative into a wider data management program and organization.

Chapter 3, The Business Case for Data Quality, explains how to calculate the costs and benefits of a data quality initiative, combining these with qualitative matters into a compelling business case for funding.

Chapter 4, Getting Started With a Data Quality Initiative, identifies the activities which are required immediately after a business case approval, such as supplier and tool selection, hiring, early remediation activities and planning. It provides a framework to ensure that all these activities make progress at the required rate early on.

Chapter 5, Data Discovery, explains how to understand business strategy and how it links to data, processes, and analytics. Once this is understood, the chapter explains how to perform a data profile and interpret the results to derive the first data quality rules.

Chapter 6, Data Quality Rules, explains how to derive a full set of business data quality rules, covering all the key elements including defining rule scope, thresholds, dimensions, and weightings. Well developed rules identify the data which does not meet the required standard efficiently and in a repeatable fashion.

Chapter 7, Monitoring Data Against Rules, outlines the various dashboards and reports required to efficiently and effectively monitor data quality against business rules.

Chapter 8, Data Quality Remediation, explains how to use the data quality dashboards and reports to prioritize and then deliver data quality improvement activities.

Chapter 9, Embedding Data Quality into Organizations, describes how to ensure that data quality improvement does not finish when the active initiative ends, , by ensuring it becomes part of day-to-day business practices.

Chapter 10, Best Practices and Common Mistakes, outlines the key best practices for a successful data quality initiative and the common mistakes that reduce the effectiveness of the work. The book ends with an analysis of how new technology such as generative AI will impact work in this field.

To get the most out of this book

You should have a basic understanding of how businesses operate, including the following:

  • Awareness of how organizations are structured, including different departments and organizational practices
  • Awareness of key processes in organizations such as procure to pay or order to cash
  • Awareness of key systems in organizations such as ERP systems and CRM systems
  • Awareness of data management concepts such as master data management, data ownership, and stewardship

Templates and diagrams

We have made some templates and diagrams available in the book’s GitHub repository here: https://github.com/PacktPublishing/Data-Quality-in-Practice.

The content included is as follows:

File name

Description

Chapter 1 – Data Governance versus Process Speed Diagram (Figure 1.3

A diagram used in the book that people may wish to tailor to their own presentations.

Chapter 2 – Business Case Template (Figure 2.3)

A template created for the book to show how you can provide quantitative calculations for your data quality initiative.

Chapter 2 – Typical One-Page Plan (Figure 2.1)

A one-page plan template that could be used as a starting point.

Chapter 6 – Report Hierarchy Diagram (Figure 6.2)

A diagram used to show how the various data quality dashboards relate to one another. This could be used in a presentation to generate ideas and feedback.

Data Quality Dashboards v2

Power BI reports developed to support the book in the monitoring chapter.

To open this file, you will need to download Power BI Desktop from Microsoft (for free).

Please note, in Power Query, the path to the source data file was removed for security reasons. Please do not apply changes (that is, use Apply Later when you open the report). If you use Apply Now, the data will disappear from the report and it will no longer be possible to explore it.

Data Quality Remediation Prioritization v

Another Power BI report – this time showing the prioritization work for the remediation chapter.

The same notes apply as for Data Quality Dashboards v2 – the source file link was removed, so changes should not be applied.

Tips or important notes

Appear like this.

Get in touch

Feedback from our readers is always welcome.

General feedback: If you have questions about any aspect of this book, email us at customercare@packtpub.com and mention the book title in the subject of your message.

Errata: Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you have found a mistake in this book, we would be grateful if you would report this to us. Please visit www.packtpub.com/support/errata and fill in the form.

Piracy: If you come across any illegal copies of our works in any form on the internet, we would be grateful if you would provide us with the location address or website name. Please contact us at copyright@packt.com with a link to the material.

If you are interested in becoming an author: If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, please visit authors.packtpub.com.

Share Your Thoughts

Once you’ve read Data Quality in Practice, we’d love to hear your thoughts! Please click here to go straight to the Amazon review page for this book and share your feedback.

Your review is important to us and the tech community and will help us make sure we’re delivering excellent quality content.

Download a free PDF copy of this book

Thanks for purchasing this book!

Do you like to read on the go but are unable to carry your print books everywhere?

Is your eBook purchase not compatible with the device of your choice?

Don’t worry, now with every Packt book you get a DRM-free PDF version of that book at no cost.

Read anywhere, any place, on any device. Search, copy, and paste code from your favorite technical books directly into your application.

The perks don’t stop there, you can get exclusive access to discounts, newsletters, and great free content in your inbox daily

Follow these simple steps to get the benefits:

  1. Scan the QR code or visit the link below

https://packt.link/free-ebook/9781804610787

  1. Submit your proof of purchase
  2. That’s it! We’ll send your free PDF and other benefits to your email directly
lock icon The rest of the chapter is locked
Next Chapter arrow right
You have been reading a chapter from
Practical Data Quality
Published in: Sep 2023 Publisher: Packt ISBN-13: 9781804610787
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}