Reader small image

You're reading from  Azure Data Engineering Cookbook

Product typeBook
Published inApr 2021
Reading LevelBeginner
PublisherPackt
ISBN-139781800206557
Edition1st Edition
Languages
Right arrow
Author (1)
Ahmad Osama
Ahmad Osama
author image
Ahmad Osama

Ahmad Osama works for Pitney Bowes Pvt. Ltd. as a technical architect and is a former Microsoft Data Platform MVP. In his day job, he works on developing and maintaining high performant, on-premises and cloud SQL Server OLTP environments as well as deployment and automating tasks using PowerShell. When not working, Ahmad blogs at DataPlatformLabs and can be found glued to his Xbox.
Read more about Ahmad Osama

Right arrow

Implementing a wrangling data flow

A wrangling data flow performs code-free data preparation at scale by integrating Power Query to prepare/transform data. The Power Query code is converted to Spark and gets executed on a Spark cluster.

In this recipe, we'll implement a wrangling data flow to read the orders.txt file, clean the data, calculate the total sales by country and customer name, and insert the data into an Azure SQL Database table.

Getting ready

To get started, do the following:

  1. Log in to https://portal.azure.com using your Azure credentials.
  2. Open a new PowerShell prompt. Execute the following command to log in to your Azure account from PowerShell:
    Connect-AzAccount
  3. You will need an existing Data Factory account. If you don't have one, create one by executing the ~/azure-data-engineering-cookbook\Chapter04\3_CreatingAzureDataFactory.ps1 PowerShell script.
  4. Create an Azure storage account and upload the files to the ~/Chapter06/Data folder...
lock icon
The rest of the page is locked
Previous PageNext Chapter
You have been reading a chapter from
Azure Data Engineering Cookbook
Published in: Apr 2021Publisher: PacktISBN-13: 9781800206557

Author (1)

author image
Ahmad Osama

Ahmad Osama works for Pitney Bowes Pvt. Ltd. as a technical architect and is a former Microsoft Data Platform MVP. In his day job, he works on developing and maintaining high performant, on-premises and cloud SQL Server OLTP environments as well as deployment and automating tasks using PowerShell. When not working, Ahmad blogs at DataPlatformLabs and can be found glued to his Xbox.
Read more about Ahmad Osama