You're reading from Power BI Machine Learning and OpenAI

Product typeBook

Published inMay 2023

Reading LevelIntermediate

PublisherPackt

ISBN-139781837636150

Edition1st Edition

Languages

Python

Tools

Power BI

Concepts

GPT/LLMs

Author (1)

Greg Beaumont

Use Cases for OpenAI

In the previous chapter, you scored fresh data via the Power BI ML models and assessed the output in comparison to the automated testing performed by Power BI during the training phase. The FAA Wildlife Strike database provided fresh data that was generated in the real world beyond the scope of the training and testing datasets. This data could potentially serve as a framework for scheduling the scoring of new data utilizing a Power BI ML model in collaboration with dataflows. The recently evaluated data produced outcomes that were relatively consistent with the expected results derived from the testing data.

In this chapter, you are tasked by your stakeholders to incorporate OpenAI functionalities into the solution. OpenAI is garnering a lot of attention in the IT sector, and this project is being implemented during this trend. Although this entails a change in scope, the project’s beneficiaries are fully supportive of and optimistic about this initiative...

Technical requirements

The requirements are slightly different for this chapter:

An account with the original open source OpenAI: https://openai.com/.
Optional – Azure OpenAI in your Azure subscription: https://azure.microsoft.com/en-us/products/cognitive-services/openai-service. The book is written so that this is optional since it is not available to everyone at the time of publication.
FAA Wildlife Strike data files from either the FAA website or the Packt GitHub site.
A Power BI Pro license.
One of the following Power BI licensing options for access to Power BI dataflows:
- Power BI Premium
- Power BI Premium Per User
One of the following options for getting data into the Power BI cloud service:
- Microsoft OneDrive (with connectivity to the Power BI cloud service)
- Microsoft Access + Power BI Gateway
- Azure Data Lake (with connectivity to the Power BI cloud service)

Brief overview and reference links for OpenAI and Azure OpenAI

In the latter part of 2022, the global media and information technology enthusiasts were captivated by the potential of ChatGPT. ChatGPT is a vast large language model (LLM) chatbot that facilitates natural language communication, code generation, and other functionalities, and was developed by OpenAI.

OpenAI

OpenAI is an AI research organization, and interested readers may find more information about it at this link: https://openai.com/about.

The renowned ChatGPT is constructed utilizing generative pre-training (GPT) models, and OpenAI also produces other types of AI models such as DALL-E for image generation. This book abstains from delving into OpenAI’s intricate details, as there is already a plethora of information available on the internet.

The OpenAI platform is constructed upon Microsoft’s Azure cloud infrastructure, which provides a powerful and reliable foundation for the platform’...

Generating descriptions with OpenAI

Our first step will be to identify a suitable use case for leveraging the power of GPT models to generate descriptions of elements of FAA Wildlife Strike data. Our objective is to unlock the potential of external data by creating prompts for GPT models that can provide detailed information and insights about the data we are working with. Through this use case, we will explore the value that GPT models can bring to the table when it comes to data analysis and interpretation.

For example, a description of the FAA Wildlife Strike database by ChatGPT might look like this:

Figure 12.2 – OpenAI ChatGPT description of FAA Wildlife Strike database

Within your solution using the FAA Wildlife Strike database, you have data that could be tied to external data using the GPT models. A few examples include additional information about the following:

Airports
FAA regions
Flight operators
Aircraft
Aircraft...

Summarizing data with OpenAI

You can also use OpenAI GPT models to summarize data. Numerous databases feature free text fields that comprise entries from a diverse array of sources, including survey results, physician notes, feedback forms, and comments regarding incident reports for the FAA Wildlife Strike database that we have used in this book. These text entry fields represent a wide range of content, from structured data to unstructured data, making it challenging to extract meaning from them without the assistance of sophisticated natural language processing tools.

The Remarks field of the FAA Wildlife Strike database contains text that was presumably entered by people involved in filling out incident forms about aircraft striking wildlife. A few examples of the remarks for recent entries are shown in Power BI in the following screenshot:

Figure 12.6 – Examples of remarks from the FAA Wildlife Strike database

You will notice that the remarks...

Choosing GPT models for your use cases

OpenAI and Azure OpenAI offer several different GPT models that can be called iteratively using an API. At the time of writing this book, there is limited availability of the new GPT-4 models, which are the latest and greatest releases. The GPT-3.5 models are available in both OpenAI and Azure OpenAI, with a few different options. The following information was referenced on March 26, 2023, from the OpenAI website at this link: https://platform.openai.com/docs/models/gpt-4.

...

Summary

In this chapter, you have delved into the fundamental concepts associated with OpenAI and Microsoft Azure OpenAI, and how these platforms can be employed to generate and summarize text. Moreover, you have explored several options for integrating GPT models from both OpenAI and Azure OpenAI into your Power BI solution using FAA Wildlife Strike data. Following a careful evaluation process, it has been determined that the text-davinci-003 GPT model will be utilized for the summarization of remarks present in FAA Wildlife Strike data reports, and for generating novel descriptive information about airplanes within the reports.

Chapter 13 will be dedicated to the implementation of functions within Power BI dataflows, enabling the seamless calling of OpenAI and Azure OpenAI REST APIs for data. These APIs will facilitate the successful implementation of your summarization and descriptive generation use cases, thereby providing new capabilities for your solution to address the challenges...

The rest of the chapter is locked

You have been reading a chapter from

Power BI Machine Learning and OpenAI

Published in: May 2023Publisher: PacktISBN-13: 9781837636150

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Greg Beaumont

Greg Beaumont is a data architect at Microsoft, where he enjoys identifying and solving complex problems backed by his experience in data architecture and a passion for innovation. Focusing on the healthcare industry, Greg works closely with customers to plan enterprise analytics strategies, evaluate new tools and products, conduct training sessions and hackathons, and architect solutions that improve the quality of care and reduce costs. He strives to be a trusted advisor to his customers and is always seeking new ways to drive progress and help organizations thrive. He is a veteran of the Microsoft data speaker network and has worked with hundreds of customers on their data management and analytics strategies.
Read more about Greg Beaumont

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages

Latest model	Description	Max tokens	Training data
gpt-3.5-turbo	Most capable GPT-3.5 model and optimized for chat at one-tenth the cost of text-davinci-003. Will be updated with our latest model iteration.	4,096 tokens	Up to September 2021