Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
The Definitive Guide to Data Integration

You're reading from  The Definitive Guide to Data Integration

Product type Book
Published in Mar 2024
Publisher Packt
ISBN-13 9781837631919
Pages 490 pages
Edition 1st Edition
Languages
Authors (4):
Pierre-Yves BONNEFOY Pierre-Yves BONNEFOY
Profile icon Pierre-Yves BONNEFOY
Emeric CHAIZE Emeric CHAIZE
Profile icon Emeric CHAIZE
Raphaël MANSUY Raphaël MANSUY
Profile icon Raphaël MANSUY
Mehdi TAZI Mehdi TAZI
Profile icon Mehdi TAZI
View More author details

Table of Contents (19) Chapters

Preface Chapter 1: Introduction to Our Data Integration Journey Chapter 2: Introducing Data Integration Chapter 3: Architecture and History of Data Integration Chapter 4: Data Sources and Types Chapter 5: Columnar Data Formats and Comparisons Chapter 6: Data Storage Technologies and Architectures Chapter 7: Data Ingestion and Storage Strategies Chapter 8: Data Integration Techniques Chapter 9: Data Transformation and Processing Chapter 10: Transformation Patterns, Cleansing, and Normalization Chapter 11: Data Exposition and APIs Chapter 12: Data Preparation and Analysis Chapter 13: Workflow Management, Monitoring, and Data Quality Chapter 14: Lineage, Governance, and Compliance Chapter 15: Various Architecture Use Cases Chapter 16: Prospects and Challenges Index Other Books You May Enjoy

Summary

This chapter examined various data transformation methodologies, tools, and use cases, including filters, aggregations, and join. Each operation’s utility and function were stated, which enabled us to cover practical applications.

Next, we explored data transformation use cases in sales analysis, social media analysis, customer segmentation, and website analytics. These case studies demonstrate the concepts’ efficacy.

SQL and Spark, two key data transformation tools, dominated this chapter. SQL, a popular query language, is used to change data, whereas Spark is a powerful data processing engine. We compared SQL and Spark’s DataFrame API to show these tools’ adaptability.

Finally, we discussed the main data transformation techniques, which include event, batch, and stream processing. We emphasized their unique features and usefulness before covering windowing. After, you learned about data transformations through practical examples and were...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}