Reader small image

You're reading from  Principles of Data Fabric

Product typeBook
Published inApr 2023
PublisherPackt
ISBN-139781804615225
Edition1st Edition
Right arrow
Author (1)
Sonia Mezzetta
Sonia Mezzetta
author image
Sonia Mezzetta

Sonia Mezzetta is a senior certified IBM architect working as a Data Fabric Program Director. She has an eye for detail and enjoys problem solving data pain points. She started her data management career in IBM as a data architect specializing in enterprise architectures. She is an expert in Data Fabric, DataOps, Data Governance, and Data Analytics. With over 20 years of experience, she has designed and architected several Enterprise data solutions. She has authored numerous data management white papers and has a master's and bachelor's degree in Computer Science. Sonia is originally from New York City, and currently resides in the area of Westchester County, New York.
Read more about Sonia Mezzetta

Right arrow

Preface

Data constitutes facts, statistics, and information based on real-world entities and events. The word fabric represents a body of material with texture and structure, such as silk cloth. These two keywords, Data Fabric, create a representation of disparate data that has been connected by a data architecture driven by governance, active metadata, automated data integration, and self-service. In today’s big data era, there are many complexities faced by enterprises looking to become data driven. Many of these issues, such as data silos, agility, lack of collaboration between business and IT, high maintenance costs, data breach, and data integrity, revolve around the large volume and velocity of proliferated data. Data Fabric is a mature, composable data architecture that faces these complexities head-on to enable the management of data at a high scale with established business value.

I wrote this book to introduce a slightly different perspective on the definition of Data Fabric architecture. The view I offer is flexible and use case agnostic and supports diverse data management styles, operational models, and technologies. I describe Data Fabric architecture as taking a people, process, and technology approach that can be applied in a complementary manner with other trending data management frameworks, such as Data Mesh and DataOps. The main theme of this book is to provide a guide to the design of Data Fabric architecture, explain the foundational role of Data Governance, and provide an understanding of how Data Fabric architecture achieves automated Data Integration and Self-Service. The technique I use is by describing “a day in the life of data” as it steps through the phases of its life cycle: create, ingest, integrate, consume, archive, and destroy. I talk about how each layer in Data Fabric architecture executes in a high-performing and thorough manner to address today’s big data complexities. I provide a set of guidelines, architecture principles, best practices, and key concepts to enable the design and implementation of a successful Data Fabric architecture.

The perspective I offer is based on decades of experience in the areas of Enterprise Architecture, Data Architecture, Data Governance, and Product Management. I remember when I started my career in Data Governance, I faced many challenges convincing others of the business value that successful data management with Data Governance achieves. I saw what many others failed to see at that time, and that was when I knew data was my passion! Since then, I’ve broadened and increased my knowledge and experience. I have learned from brilliant thought leaders at IBM and a diverse set of clients. All these experiences have shaped the frame of reference in this book.

As technologists, we are very passionate about our points of view, ideas, and perspectives. This is my point of view on what a Data Fabric architecture design represents, which aims to achieve significant business value while addressing the complexities enterprises face today.

Note

The views expressed in the book belong to the author and do not necessarily represent the opinions or views of their employer, IBM.

Who this book is for

This book is for an organization looking to venture on a digital transformation journey, or an existing data-driven organization looking to mature further in their data journey. It is intended for a diverse set of roles, both business and technical, with a vested interest in strategic, automated, and modern data management, including the following:

  • Executive leaders such as chief data officers, chief technology officers, chief information officers, and data leaders prioritizing strategic investments to execute an enterprise data strategy
  • Enterprise architects, data architects, Data Governance roles such as data security, data privacy roles, and technical leaders tasked with designing and implementing a mature and governed Self-Service data platform
  • Business analysts and data scientists looking to understand their role as data producers or data consumers in a Self-Service ecosystem leveraging Data Fabric architecture
  • Developers such as data engineers, software engineers, and business intelligence developers looking to comprehend Data Fabric architecture to learn how it achieves the rapid development of governed, trusted data

What this book covers

Chapter 1, Introducing Data Fabric, presents an introduction to the definition of Data Fabric architecture. It offers a position on what Data Fabric is and what it isn’t. Key characteristics and architectural principles are explained. Essential concepts and terminology are defined. The business value statement of Data Fabric architecture is discussed and the core building blocks that make up its design are established.

Chapter 2, Show Me the Business Value, is a chapter focused on providing a business point of view on the benefits of Data Fabric architecture. It establishes the business value the architecture offers by explaining how the building blocks that make up a Data Fabric design address pain points faced by enterprises today. Data Fabric architecture takes a strategic and multi-faceted approach to achieve data monetization. Real-life examples have been positioned on the impact of not having the right level of focus provided by each of Data Fabric’s building blocks. Finally, a perspective is offered on how Data Fabric architecture can be leveraged by large, medium-sized, and small organizations.

Chapter 3, Choosing between Data Fabric and Data Mesh, provides an overview of the key principles of Data Mesh architecture. Both Data Fabric and Data Mesh are discussed, including where they share similar objectives and where they take different but complementary approaches. Both architectures represent sophisticated designs focused on data trust and enable the high-scale sharing of quality data. This chapter closes with a view on how Data Fabric and Data Mesh can be used together to achieve rapid data access, high-quality data, and automated Data Governance.

Chapter 4, Introducing DataOps, introduces the DataOps framework. It discusses the business value it provides and describes the 18 driving principles that make up DataOps. The role of data observability and its relationship to the Data Quality and Data Governance pillar is explained. This chapter concludes by explaining how to apply DataOps as an operational model for Data Fabric architecture.

Chapter 5, Building a Data Strategy, kicks off the creation and implementation of a data strategy document. It describes a data strategy document as a visionary statement and a plan for profitable revenue and cost savings. You will familiarize yourself with the different sections that should be defined in a data strategy document, and have a reference of three data maturity frameworks to use as input in a data strategy. The chapter ends with tips on how Data Fabric architecture can be positioned as part of a data strategy document.

Chapter 6, Designing a Data Fabric Architecture, sets the foundation for the design of a Data Fabric architecture. It introduces key architecture concepts and architecture principles that compose the logical data architecture of a Data Fabric. The three architecture layers, Data Governance, Data Integration, and Self-Service, in a Data Fabric architecture are introduced. The objectives of each layer are highlighted, with a discussion on the necessary capabilities represented as components.

Chapter 7, Designing Data Governance, dives into the design of the Data Governance layer of a Data Fabric architecture. Key architecture patterns, such as metadata-driven and event-driven architectures, are discussed. The architecture components, such as active metadata, metadata knowledge graphs, and life cycle governance, are explained. The chapter ends with an explanation of how the Data Governance layer executes and governs data at each phase in its life cycle.

Chapter 8, Designing Data Integration and Self-Service, drills into the design of the two remaining architecture layers in a Data Fabric, Data Integration and Self-Service. The Data Integration layer is reviewed, which focuses on the development of data with a DataOps lens. The Self-Service layer is also discussed, including how it aims to democratize data. An understanding is provided of how both architecture layers work with each other, and how they rely on the Data Governance layer. At the end of the chapter, a Data Fabric reference architecture is presented.

Chapter 9, Realizing a Data Fabric Technical Architecture, positions a technical Data Fabric architecture as modular and composable, consisting of several tools and technologies. The required capabilities and the kinds of tools to implement each of the three layers in a Data Fabric architecture are discussed. Two use cases are reviewed – distributed data management via Data Mesh and regulatory compliance – as examples of how to apply a Data Fabric architecture. The chapter ends by presenting a Data Fabric with Data Mesh technical reference architecture.

Chapter 10, Industry Best Practices, presents 16 best practices in data management. Best practices are grouped into four categories: Data Strategy, Data Architecture, Data Integration and Self-Service, and Data Governance. Each best practice is described and has a why should you care statement.

To get the most out of this book

To understand the key concepts and themes in this book, you should have a general understanding of the IT industry, enterprise architectures, data management, and Data Governance.

Conventions used

There are a number of text conventions used throughout this book.

Bold: Indicates a new term, an important word, or words that you see onscreen. For instance, words in menus or dialog boxes appear in bold. Here is an example: “Select System info from the Administration panel.”

Tips or important notes

Appear like this.

Get in touch

Feedback from our readers is always welcome.

General feedback: If you have questions about any aspect of this book, email us at customercare@packtpub.com and mention the book title in the subject of your message.

Errata: Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you have found a mistake in this book, we would be grateful if you would report this to us. Please visit www.packtpub.com/support/errata and fill in the form.

Piracy: If you come across any illegal copies of our works in any form on the internet, we would be grateful if you would provide us with the location address or website name. Please contact us at copyright@packt.com with a link to the material.

If you are interested in becoming an author: If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, please visit authors.packtpub.com.

Share Your Thoughts

Once you’ve read Principles of Data Fabric, we’d love to hear your thoughts! Please click here to go straight to the Amazon review page for this book and share your feedback.

Your review is important to us and the tech community and will help us make sure we’re delivering excellent quality content.

Download a free PDF copy of this book

Thanks for purchasing this book!

Do you like to read on the go but are unable to carry your print books everywhere?

Is your eBook purchase not compatible with the device of your choice?

Don’t worry, now with every Packt book you get a DRM-free PDF version of that book at no cost.

Read anywhere, any place, on any device. Search, copy, and paste code from your favorite technical books directly into your application.

The perks don’t stop there, you can get exclusive access to discounts, newsletters, and great free content in your inbox daily

Follow these simple steps to get the benefits:

  1. Scan the QR code or visit the link below

https://packt.link/free-ebook/9781804615225

  1. Submit your proof of purchase
  2. That’s it! We’ll send your free PDF and other benefits to your email directly
lock icon
The rest of the chapter is locked
You have been reading a chapter from
Principles of Data Fabric
Published in: Apr 2023Publisher: PacktISBN-13: 9781804615225
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
undefined
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime

Author (1)

author image
Sonia Mezzetta

Sonia Mezzetta is a senior certified IBM architect working as a Data Fabric Program Director. She has an eye for detail and enjoys problem solving data pain points. She started her data management career in IBM as a data architect specializing in enterprise architectures. She is an expert in Data Fabric, DataOps, Data Governance, and Data Analytics. With over 20 years of experience, she has designed and architected several Enterprise data solutions. She has authored numerous data management white papers and has a master's and bachelor's degree in Computer Science. Sonia is originally from New York City, and currently resides in the area of Westchester County, New York.
Read more about Sonia Mezzetta