Reader small image

You're reading from  Fundamentals of Analytics Engineering

Product typeBook
Published inMar 2024
PublisherPackt
ISBN-139781837636457
Edition1st Edition
Right arrow
Authors (7):
Dumky De Wilde
Dumky De Wilde
author image
Dumky De Wilde

Dumky is an award-winning analytics engineer with close to 10 years of experience in setting up data pipelines, data models and cloud infrastructure. Dumky has worked with a multitude of clients from government to fintech and retail. His background is in marketing analytics and web tracking implementations, but he has since branched out to include other areas and deliver value from data and analytics across the entire organization.
Read more about Dumky De Wilde

Fanny Kassapian
Fanny Kassapian
author image
Fanny Kassapian

Fanny has a multidisciplinary background across various industries, giving her a unique perspective on analytics workflows, from engineering pipelines to driving value for the business. As a consultant, Fanny helps companies translate opportunities and business needs into technical solutions, implement analytics engineering best practices to streamline their pipelines, and treat data as a product. She is an avid promoter of data democratization, through technology and literacy
Read more about Fanny Kassapian

Jovan Gligorevic
Jovan Gligorevic
author image
Jovan Gligorevic

Jovan, an Analytics Engineer, specializes in data modeling and building analytical dashboards. Passionate about delivering end-to-end analytics solutions and enabling self-service analytics, he has a background in business and data science. With skills ranging from machine learning to dashboarding, Jovan has democratized data across diverse industries. Proficient in various tools and programming languages, he has extensive experience with the modern data stack. Jovan enjoys providing trainings in dbt and Power BI, sharing his knowledge generously
Read more about Jovan Gligorevic

Juan Manuel Perafan
Juan Manuel Perafan
author image
Juan Manuel Perafan

Juan Manuel Perafan 8 years of experience in the realm of analytics (5 years as a consultant). Juan was the first analytics engineer hired by Xebia back in 2020. Making him one of the earliest adopters of this way of working. Besides helping his clients realizing the value of their data, Juan is also very active in the data community. He has spoken at dozens of conferences and meetups around the world (including Coalesce 2023). Additionally, he is the founder of the Analytics Engineering meetup in the Netherlands as well as the Dutch dbt meetup
Read more about Juan Manuel Perafan

Lasse Benninga
Lasse Benninga
author image
Lasse Benninga

Lasse has been working in the dataspace since 2018, starting out as a Data Engineer at a large airline, then switching towards Cloud Engineering for a consultancy and working for different clients in the retailing and healthcare space. Since 2021, he's an Analytics Engineer at Xebia Data, merging software/platform engineering with analytics passion. As a consultant Lasse has seen many different clients, ranging from retail, healthcare, ridesharing industry, and trading companies. He has implemented multiple data platforms and worked in all three major clouds, leveraging his knowledge of data and analytics to provide value
Read more about Lasse Benninga

Ricardo Angel Granados Lopez
Ricardo Angel Granados Lopez
author image
Ricardo Angel Granados Lopez

Ricardo, an Analytics Engineer with a strong background in data engineering and analysis, is a quick learner and tech enthusiast. With a Master's in IT Management specializing in Data Science, he excels in using various programming languages and tools to deliver valuable insights. Ricardo, experienced in diverse industries like energy, transport, and fintech, is adept at finding alternative solutions for optimal results. As an Analytics Engineer, he focuses on driving value from data through efficient data modeling, using best practices, automating tasks and improving data quality
Read more about Ricardo Angel Granados Lopez

Taís Laurindo Pereira
Taís Laurindo Pereira
author image
Taís Laurindo Pereira

Taís is a versatile data professional with experience in a diverse range of organizations - from big corporations to scale-ups. Before her move to Xebia, she had the chance to develop distinct data products, such as dashboards and machine learning implementations. Currently, she has been focusing on end-to-end analytics as an Analytics Engineer. With a mixed background in engineering and business, her mission is to contribute to data democratization in organizations, by helping them to overcome challenges when working with data at scale
Read more about Taís Laurindo Pereira

View More author details
Right arrow

Data Governance

Data governance can be irresponsibly defined as, everything you need to do to ensure data is compliant, secure, accurate, available, and valuable.

Most data professionals and technologists are rebels. Instinctively, we worry more about making something work rather than reflecting on the role of our solution in the extensive landscape of our organizations. We often enjoy novelty and working with new technology but not the administration and processes to support that work. In fact, we secretly believe most people reading this book will skip this chapter after reading the title. However, we urge you to stay a bit longer. Data governance is not the sexiest topic, but it is what separates mature data teams from the rest.

In short, doing data governance well can be challenging. It requires a lot of discipline, empathy, detail-oriented thinking, and (most often lacking) dedication. But it is probably one of the most essential activities to ensure that your work benefits...

Understanding data governance

Data governance is a comprehensive and disciplined approach to managing and protecting an organization’s data assets. It involves establishing a framework encompassing people, processes, rules, policies, tools, and guidelines to ensure data quality, security, and compliance.

Any data governance framework should specify the interactions between its three pillars: people, process, and technology.

The people pillar involves establishing clear roles and responsibilities for individuals within the organization. This includes people who create guidelines, oversee compliance, and implement these policies in the tools we use (the last is probably you). It is crucial to have a team that understands the importance of data governance and can effectively implement and enforce the necessary policies and procedures.

The process pillar involves developing data lifecycle management processes that cover data creation, storage, usage, sharing, and disposal...

Applying data governance in analytics engineering

So far, the concept of data governance might feel overwhelming or a bit fuzzy. If so, there is no need to worry. Nobody is going to ask you for a textbook definition. In fact, nobody else in your team may have ever heard of governance.

Nevertheless, even if they do not call by its name, your colleagues will often discuss the issues caused by poor governance. They will talk about how much time they waste cleaning data or their frustration when nobody fixes data quality at the source. For this reason, from now on, we will talk about governance in the way that most people understand: highlighting its main topics and demonstrating how they are relevant to your work.

Governance is interwoven into your role as an analytics engineer, especially if enabling data analysts and other data consumers is among your responsibilities. In this section, we will provide a comprehensive overview of the essential elements that data teams need to consider...

Addressing critical areas for seamless data governance

Enabling data governance at your organization is not going to be seamless. There will always be some challenges. From a lack of awareness and understanding to resistance to change and adoption, we will delve into these challenges and provide insights on overcoming them.

Resistance to change and adoption

People’s resistance to change in the field of IT is extremely common, especially if your organization is not primarily or entirely online. Technology moves fast, and people must constantly adapt to new systems, processes, and tools. However, many tend to resist change due to various reasons. Some fear the unknown, some do not have enough broadband to relearn how to do their job, and some just want to maintain the status quo. Regardless of the reason, here are some strategies to address this resistance:

  • Effective communication: It is important to communicate the reasons behind the change, its benefits, and how...

Summary

Data governance refers to any task you must do to make your data compliant, secure, accurate, available, and useful. Even though organizations often ignore it, it sets mature data teams apart. It enables you to work towards your strategic goals and reduce the hours wasted maintaining and fixing existing data assets.

In this chapter, we discuss some key topics in governance, such as ownership, data quality, managing data assets, training, and data modeling. A recurrent theme is that building governance roadmaps from scratch is generally not your responsibility. However, analytics engineers are in a privileged position to understand issues with the data and have enough technical knowledge to correct them at the source.

Working on data governance is never going to be easy. You will face resistance to change and need to get buy-in from your stakeholders to ensure the success of your initiatives. However, any goal you achieve will translate into a much better experience for...

lock icon
The rest of the chapter is locked
You have been reading a chapter from
Fundamentals of Analytics Engineering
Published in: Mar 2024Publisher: PacktISBN-13: 9781837636457
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
undefined
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at €14.99/month. Cancel anytime

Authors (7)

author image
Dumky De Wilde

Dumky is an award-winning analytics engineer with close to 10 years of experience in setting up data pipelines, data models and cloud infrastructure. Dumky has worked with a multitude of clients from government to fintech and retail. His background is in marketing analytics and web tracking implementations, but he has since branched out to include other areas and deliver value from data and analytics across the entire organization.
Read more about Dumky De Wilde

author image
Fanny Kassapian

Fanny has a multidisciplinary background across various industries, giving her a unique perspective on analytics workflows, from engineering pipelines to driving value for the business. As a consultant, Fanny helps companies translate opportunities and business needs into technical solutions, implement analytics engineering best practices to streamline their pipelines, and treat data as a product. She is an avid promoter of data democratization, through technology and literacy
Read more about Fanny Kassapian

author image
Jovan Gligorevic

Jovan, an Analytics Engineer, specializes in data modeling and building analytical dashboards. Passionate about delivering end-to-end analytics solutions and enabling self-service analytics, he has a background in business and data science. With skills ranging from machine learning to dashboarding, Jovan has democratized data across diverse industries. Proficient in various tools and programming languages, he has extensive experience with the modern data stack. Jovan enjoys providing trainings in dbt and Power BI, sharing his knowledge generously
Read more about Jovan Gligorevic

author image
Juan Manuel Perafan

Juan Manuel Perafan 8 years of experience in the realm of analytics (5 years as a consultant). Juan was the first analytics engineer hired by Xebia back in 2020. Making him one of the earliest adopters of this way of working. Besides helping his clients realizing the value of their data, Juan is also very active in the data community. He has spoken at dozens of conferences and meetups around the world (including Coalesce 2023). Additionally, he is the founder of the Analytics Engineering meetup in the Netherlands as well as the Dutch dbt meetup
Read more about Juan Manuel Perafan

author image
Lasse Benninga

Lasse has been working in the dataspace since 2018, starting out as a Data Engineer at a large airline, then switching towards Cloud Engineering for a consultancy and working for different clients in the retailing and healthcare space. Since 2021, he's an Analytics Engineer at Xebia Data, merging software/platform engineering with analytics passion. As a consultant Lasse has seen many different clients, ranging from retail, healthcare, ridesharing industry, and trading companies. He has implemented multiple data platforms and worked in all three major clouds, leveraging his knowledge of data and analytics to provide value
Read more about Lasse Benninga

author image
Ricardo Angel Granados Lopez

Ricardo, an Analytics Engineer with a strong background in data engineering and analysis, is a quick learner and tech enthusiast. With a Master's in IT Management specializing in Data Science, he excels in using various programming languages and tools to deliver valuable insights. Ricardo, experienced in diverse industries like energy, transport, and fintech, is adept at finding alternative solutions for optimal results. As an Analytics Engineer, he focuses on driving value from data through efficient data modeling, using best practices, automating tasks and improving data quality
Read more about Ricardo Angel Granados Lopez

author image
Taís Laurindo Pereira

Taís is a versatile data professional with experience in a diverse range of organizations - from big corporations to scale-ups. Before her move to Xebia, she had the chance to develop distinct data products, such as dashboards and machine learning implementations. Currently, she has been focusing on end-to-end analytics as an Analytics Engineer. With a mixed background in engineering and business, her mission is to contribute to data democratization in organizations, by helping them to overcome challenges when working with data at scale
Read more about Taís Laurindo Pereira