You're reading from Microsoft Power BI Cookbook. - Second Edition

Product typeBook

Published inSep 2021

PublisherPackt

ISBN-139781801813044

Edition2nd Edition

Tools

Power BI

Concepts

Business Intelligence

Authors (2):

Gregory Deckler

Brett Powell

View More author details

Importing Data

Import is the default data connectivity mode for Power BI Desktop. Import models created in Power BI Desktop use the same in-memory, columnar compressed storage engine (VertiPaq) featured in Analysis Services Tabular 2016+ import models. Import mode models support the integration of disparate data sources (for example, SQL Server and DB2) and allow more flexibility in developing metrics and row-level security roles via full support for all DAX functions.

There are some limits for Import mode datasets, however. For example, Power BI Pro license users cannot publish Power BI Desktop files to shared capacity in the Power BI service that are larger than 1GB. Power BI Premium (dedicated, isolated hardware) supports datasets of 10GB in size and larger (with large datasets enabled, dataset size is limited by the Premium capacity size or the maximum size set by the administrator). With such large datasets, it is important to consider employing incremental refresh where only new and changed data is refreshed and imported, instead of the entire dataset being refreshed.

This recipe describes a process of using M and the Query Editor to develop the Import mode queries for a standard star-schema analytical model. A staging query approach is introduced as a means of efficiently enhancing the dimensions of a model. In addition, tips are included for using fewer resources during the refresh and avoiding refresh failures from revised source data. More details of these methods are included in other recipes in this chapter.

Getting ready

In this example, the DimProduct, DimProductSubcategory, and DimProductCategory tables from the AdventureWorksDW2019 database are integrated into a single import query. This query includes all product rows, only the English language columns, and user-friendly names. Many-to-one relationships have been defined in the source database.

To prepare for this recipe, do the following:

Open Power BI Desktop.
Create an Import mode data source query called AdWorksDW. This query should be similar to the following:
```
let
    Source = Sql.Database("localhost\MSSQLSERVERDEV", "AdventureWorksDW2019")
in
    Source
```
Isolate this query in a query group called Data Sources.
Disable loading of this query.

For additional details on performing these steps, see the Managing Queries and Data Sources recipe in this chapter.

How to import data

To implement this recipe, perform the following steps:

Right-click AdWorksDW and choose Reference. This creates a new query that references the AdWorksDW query as its source.
Select this new query and, in the preview data, find the DimProduct table in the Name column. Click on the Table link in the Data column for this row.
Rename this query DimProduct.
Repeat steps 1 – 3 for the DimProductCategory and DimProductSubcategory tables.
Create a new query group called Staging Queries.
Move the DimProduct, DimProductCategory, and DimProductSubcategory queries to the Staging Queries group.
Disable loading for all queries in the Staging Queries group. Your finished set of queries should look similar to Figure 2.20.

Figure 2.20: Staging Queries

The italics indicate that the queries will not be loaded into the model.
Create a new Blank Query and name this query Products.
Open the Advanced Editor for the Products query.
In the Products query, use the Table.NestedJoin function to join the DimProduct and DimProductSubcategory queries. This is the same function that is used if you were to select the Merge Queries option in the ribbon of the Home tab. A left outer join is required to preserve all DimProduct rows, since the foreign key column to DimProductCategory allows null values.
Add a Table.ExpandColumns expression to retrieve the necessary columns from the DimProductSubcategory table. The Products query should now have the following code:
```
let
    ProductSubCatJoin = 
        Table.NestedJoin(
            DimProduct,"ProductSubcategoryKey",
            DimProductSubcategory,"ProductSubcategoryKey",
            "SubCatColumn",JoinKind.LeftOuter
        ),
    ProductSubCatColumns =
        Table.ExpandTableColumn(
            ProductSubCatJoin,"SubCatColumn",
            {"EnglishProductSubcategoryName","ProductCategoryKey"},
            {"Product Subcategory", "ProductCategoryKey"}
        )
in
    ProductSubCatColumns
```
The NestedJoin function inserts the results of the join into a column (SubCatColumn) as table values. The second expression converts these table values into the necessary columns from the DimProductSubcategory query and provides the simple Product Subcategory column name, as shown in Figure 2.21.

Figure 2.21: Product Subcategory Columns Added

The query preview in the Power Query Editor will expose the new columns at the far right of the preview data.
Add another expression beneath the ProductSubCatColumns expression with a Table.NestedJoin function that joins the previous expression (the Product to Subcategory join) with the DimProductCategory query.
Just like step 8, use a Table.ExpandTableColumn function in a new expression to expose the required Product Category columns.
```
        ),
    
    ProductCatJoin = 
        Table.NestedJoin(
            ProductSubCatColumns,"ProductCategoryKey",
            DimProductCategory,"ProductCategoryKey",
            "ProdCatColumn",JoinKind.LeftOuter
        ),
    ProductCatColumns = 
        Table.ExpandTableColumn(
            ProductCatJoin,"ProdCatColumn",
            {"EnglishProductCategoryName"}, {"Product Category"}
        )
in
    ProductCatColumns
```
Be certain to add a comma after the ProductSubCatColumns expression. In addition, be sure to change the line beneath the in keyword to ProductCatColumns.

The expression ProductCatJoin adds the results of the join to DimProductCategory (the right table) to the new column (ProdCatColumn). The next expression, ProductCatColumns adds the required Product Category columns and revises the EnglishProductCategoryName column to Product Category. A left outer join was necessary with this join operation as well since the product category foreign key column on DimProductSubcategory allows null values.
Add an expression after the ProductCatColumns expression that selects the columns needed for the load to the data model with a Table.SelectColumns function.

In addition, add a final expression to rename these columns via Table.RenameColumns to eliminate references to the English language and provide spaces between words.

        ),
    SelectProductColumns = 
        Table.SelectColumns(ProductCatColumns,
            {
                "ProductKey", "EnglishDescription",
                "EnglishProductName", "Product Subcategory", "Product 
Category"
            }
        ),
    RenameProductColumns = 
        Table.RenameColumns(SelectProductColumns,
            {
                {"EnglishDescription", "Product Description"}, 
                {"EnglishProductName", "Product Name"}
            }
        )
in
    RenameProductColumns

Be certain to add a comma after the ProductCatColumns expression. In addition, change the line beneath the in keyword to RenameProductColumns.

The preview in the Power Query Editor for the Products query should now be similar to that shown in Figure 2.22.

Figure 2.22: Product Query Results

It is not necessary to rename the ProductKey column since this column will be hidden from the reporting layer. In practice, the product dimension would include many more columns. Closing and applying the changes results in only the Products table being loaded into the model.

The denormalized Products table now supports a three-level hierarchy in the Power BI Desktop model to significantly benefit reporting and analysis.

Figure 2.23: Product Hierarchy

How it works

The default join kind for Table.NestedJoin is a left outer join. However, as other join kinds are supported (for example, inner, anti, and full outer), explicitly specifying this parameter in expressions is recommended. Left outer joins are required in the Products table example, as the foreign key columns on DimProduct and DimProductSubcategory both allow null values. Inner joins implemented either via Table.NestedJoin or Table.Join functions are recommended for performance purposes otherwise. Additional details on the joining functions as well as tips on designing inline queries as an alternative to staging queries are covered in the Combining and Merging Queries recipe in this chapter.

When a query joins two tables via a Table.NestedJoin or Table.Join function, a column is added to the first table containing a Table object that contains the joined rows from the second table. This column must be expanded using a Table.ExpandTableColumn function, which generates additional rows as specified by the join operation.

Once all rows are generated by the join and column expansion operations, the specific columns desired in the end result can be specified by the Table.SelectColumns operation; these columns can then be renamed as desired using the Table.RenameColumns function.

There's more...

Using Import mode, we can do many things to enhance our queries to aid in report development and display. One such example is that we can add additional columns to provide automatic sorting of an attribute in report visuals. Specifically, suppose that we wish for the United States regional organizations to appear next to one another by default in visualizations. By default, since the Organization column in the DimOrganization table in AdventureWorksDW2019 is a text column, the Central Division (a part of the USA), appears between Canada and France based upon the default alphabetical sorting of text columns. We can modify a simple query that pulls the DimOrganization table to add a numeric sorting column. To see how this works, follow these steps:

Using the same Power BI file used for this recipe, open the Power Query Editor, right-click the AdWorksDW query, and select Reference.
Choose the DimOrganization table and rename the query to DimOrganization.
Open the Advanced Editor window for the DimOrganization query.
Add a Table.Sort expression to the import query for the DimOrganization dimension. The columns for the sort should be at the parent or higher level of the hierarchy.

Add an expression with the Table.AddIndexColumn function that will add a sequential integer based on the table sort applied in the previous step. The completed query should look something like the following:

let
    Source = AdWorksDW,
    dbo_DimOrganization = 
        Source{[Schema="dbo",Item="DimOrganization"]}[Data],
    OrgSorted = 
        Table.Sort(
            dbo_DimOrganization,
            {
                {"ParentOrganizationKey", Order.Ascending},
                {"CurrencyKey", Order.Ascending}
            }
        ),
    OrgSortIndex = Table.AddIndexColumn(OrgSorted,"OrgSortIndex",1,1) 
in
    OrgSortIndex

Finally, with the Ctrl key pressed, select the OrganizationKey, OrganizationName, and OrgSortIndex columns by clicking their column headers. Right-click on the OrgSortIndex column and choose to Remove Other Columns. The preview data should now show as presented in Figure 2.24.

Figure 2.24: Modified Organization Dimension Query

With this expression, the table is first sorted by the ParentOrganizationKey column and then by the CurrencyKey column. The new index column starts at the first row of this sorted table with an incremental growth of one per row. The net effect is that all of the US divisions are grouped together at the end of the table.

We can now use this new index column to adjust the default alphanumeric sorting behavior of the OrganizationName column. To see how this works, perform the following steps:

Choose Close & Apply to exit Power Query Editor to load the DimOrganization table.
In the Data View, select the OrganizationName column.
From the Column tools tab, set the Sort by column drop-down to the OrgSortIndex column.

Figure 2.25: Sort By in Data View
Finally, right-click on the OrgSortIndex column and select Hide in report view.

Visuals using the OrganizationName column will now sort the values by their parent organization such that the USA organizations appear together (but not alphabetically).

Figure 2.26: Organization automatically sorted

Greg Deckler is a 7-time Microsoft MVP for Data Platform and an active blogger and Power BI community member, having written over 6,000 solutions to community questions. Greg has authored many books on Power BI, including Learn Power BI 1st and 2nd Editions, DAX Cookbook, Power BI Cookbook 2nd Edition and Mastering Power BI 2nd Edition. Greg has also created several external tools for Power BI and regularly posts video content to his YouTube channels, Microsoft Hates Greg and DAX For Humans.
Read more about Gregory Deckler

Brett Powell

Brett Powell is the owner of and business intelligence consultant at Frontline Analytics LLC, a data and analytics research and consulting firm and Microsoft Power BI partner. He has worked with Power BI technologies since they were first introduced as the PowerPivot add-in for Excel 2010 and has been a Power BI architect and lead BI consultant for organizations across the retail, manufacturing, and financial services industries. Additionally, Brett has led Boston's Power BI User Group, delivered presentations at technology events such as Power BI World Tour, and maintains the popular Insight Quest Microsoft BI blog.
Read more about Brett Powell

Other recommended products

Related to this chapter

Mastering Microsoft Power BI

This book will show you how to use Power BI effectively to create a variety of visualizations and BI dashboards. Right from gathering data through various data sources, you will learn to perform effective visual analytics. By the end of this book, you will be able to gain unique, hidden insights into your data using Microsoft Power BI.

BookMar 2018638 pages5

Expert Data Modeling with Power BI

This book shows you how to effectively use Power BI, covering everything from Fact tables and Dimension tables to different data modeling techniques. With the help of real-world scenarios, you'll be able to identify when, why, and how to prepare data to support an efficient Star Schema to satisfy business requirements.

BookJun 2021612 pages

Learn Power BI

This book will help you bring business intelligence capabilities using Power BI in order to make smarter decisions. You will learn data modeling, visualizations, and analytical capabilities from scratch using hands-on examples. By the end of this book, you will learn the extensive features of Power BI and how to use them to unlock business insights.

BookSep 2019362 pages

Learn Power Query

This book will effectively guide you through Power Query, starting with the shortcomings of other tools with regard to data analysis and management. You’ll then delve into the Power Query interface, understand how to connect, combine, and refine data with query tools, and finally create dashboards and multi-dimensional reports in Power Query.

BookJul 2020428 pages

Microsoft Power BI Quick Start Guide

Microsoft Power BI Quick Start Guide, Second Edition gets you up to speed with Power BI quickly, enabling you to derive actionable insights from your data using the data visualization capabilities of Microsoft Power BI within a short span of time.

BookOct 2020296 pages

Microsoft Power BI Quick Start Guide

Microsoft Power BI is a cloud-based service that helps you easily visualize and share insights from your organization's data. This book will get you started with Business Intelligence using the Power BI tool, covering essential concepts like installation, building basic dashboards and visualizations to make your data come to life.

BookJul 2018202 pages

Hands-On Business Intelligence with DAX

This book follows a step-by-step explanation of essential concepts, practical examples, and self-assessment questions. You will begin by learning the basics of DAX, along with important concepts such as evaluation contexts and data modeling, before moving on to more advanced topics such as query optimization.

BookJan 2020402 pages

DAX Cookbook

DAX is a library of functions and operators that can be combined to build formulas and expressions in Power BI Desktop, Azure Analysis Services, SQL Server Analysis Services, and Power Pivot in Excel. This book is a desk reference for people who want to leverage DAX's functionality and flexibility in BI and data analytics domains.

BookMar 2020552 pages

Hands-On SQL Server 2019 Analysis Services

This book will expand your ability to deliver meaningful, performant solutions to your organization. You’ll learn how to use an analytical engine for decision making and business analytics. With the help of this practical guide, you’ll also be able to work confidently with data and analytics.

BookOct 2020474 pages

Tabular Modeling with SQL Server 2016 Analysis Services Cookbook

BookJan 2017372 pages

Building Dashboards with Microsoft Dynamics GP 2016

The book shows you in detail how to build great-looking dashboards with Microsoft Dynamics GP. It will take you from creating amazing dashboards using various tools such as Excel 2016, Power BI to Jet Express. We will also cover core topics such as Business Analyzer, Microsoft SQL Reporting services reports, BI360, and more.

BookMar 2017354 pages

Introducing Microsoft SQL Server 2019

Introducing Microsoft SQL Server 2019 takes you through what’s new in SQL Server 2019 and why it matters. After reading this book, you’ll be well placed to explore exactly how you can make MIcrosoft SQL Server 2019 work best for you.

BookApr 2020488 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages

You're reading from Microsoft Power BI Cookbook. - Second Edition

Importing Data

Getting ready

How to import data

How it works

There's more...

See also

Authors (2)

Mastering Microsoft Power BI

Expert Data Modeling with Power BI

Learn Power BI

Learn Power Query

Microsoft Power BI Quick Start Guide

Microsoft Power BI Quick Start Guide, Second Edition gets you up to speed with Power BI quickly, enabling you to derive actionable insights from your data using the data visualization capabilities of Microsoft Power BI within a short span of time.

Microsoft Power BI Quick Start Guide

Hands-On Business Intelligence with DAX

DAX Cookbook

Hands-On SQL Server 2019 Analysis Services

This book will expand your ability to deliver meaningful, performant solutions to your organization. You’ll learn how to use an analytical engine for decision making and business analytics. With the help of this practical guide, you’ll also be able to work confidently with data and analytics.

Tabular Modeling with SQL Server 2016 Analysis Services Cookbook

Building Dashboards with Microsoft Dynamics GP 2016

Introducing Microsoft SQL Server 2019

Introducing Microsoft SQL Server 2019 takes you through what’s new in SQL Server 2019 and why it matters. After reading this book, you’ll be well placed to explore exactly how you can make MIcrosoft SQL Server 2019 work best for you.

Et al.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Mastering Tableau 2023

Building AI Applications with ChatGPT APIs

Building AI Applications with ChatGPT APIs

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

Modern Data Architecture on AWS

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

TinyML Cookbook