You're reading from Hands-On Machine Learning with Microsoft Excel 2019

Product typeBook

Published inApr 2019

PublisherPackt

ISBN-139781789345377

Edition1st Edition

Tools

Excel

Concepts

Machine Learning

Author (1)

Julio Cesar Rodriguez Martino

Assessment

Chapter 1, Implementing Machine Learning Algorithms

In classical programming, the code developed and run in the computer is a step-by-step set of instructions telling the computer what to do and how to handle different options. Machine learning is about showing the computer examples of data to either teach it what to do by example, or to let it learn information that is hidden in the data.
The machine learning models can be either regression (if the target variable is numerical and continuous) or classification (if the target variable is categorical or discrete).
Models that learn by example, training on labeled data, are called supervised machine learning models. In comparison, those that find information in the unlabeled data are called unsupervised machine learning models.
The following are the main steps that are needed when creating and using a machine learning model:
1. Obtaining...

Chapter 2, Hands-On Examples of Machine Learning Models

Encoding prepares categorical features in order to feed them into a machine learning model and does not assume any prior correlation between the encoded values.
By setting a limit to the length of the tree or by defining a minimum entropy value.
Temperature_hot is equally split; two values end in Train_outside = yes, and two values end in Train_outside = no. This represents the maximum entropy value, where there is no clear information about what to do if the temperature is hot.

The following IF statements would be considered when deciding whether or not to train outside:
- If outlook is Sunny and it's not windy, then train outside.
- If outlook is Sunny and it's windy, then don't train outside.
- If outlook is Overcast, then Train outside.
- If outlook is Rainy and Humidity is high, then don't train outside...

Chapter 3, Importing Data into Excel from Different Data Sources

Any character that is not confused with the file contents.
The outcome of a machine learning model will be affected by missing or incorrect data entries, and the correct format should also be used.
Importing an Excel file will open the Power Query interface in order to preprocess the data.
Data that is in a tabular form.
An exhaustive list can be found at https://gist.github.com/gelisam/13d04ac5a54b577b2492785c1084281f.
An example can be found at https://stackoverflow.com/questions/38120895/database-vs-file-system-storage.

Chapter 4, Data Cleansing and Preliminary Data Analysis

Instead of building the decision tree manually, it would be interesting to study in-depth the example built-in Azure Machine Learning Studio, which was shown in Chapter 10, Azure and Excel - Machine Learning in the Cloud.
cabin and fare, pclass and fare, home.dest and fare are some examples.
Missing values could be replaced by the mean value of the variable.
Any unbalance in the dataset is referred to as bias. This will affect the results of any machine learning model, since the model will find more examples of a given class or some tendency to a particular target value.
You can, for example, try to see some correlations between variables using scatter plots.

Chapter 5, Correlations and the Importance of Variables

You can, for example, build a diagram with the categorical values on the x axis and the numerical values on the y axis; any correlation would be clear from this diagram.
It should be easy for the reader to build diagrams and understand the relationship between variables.
No. It means that when a variable increases, the other variable decreases.
This formatting was used in Chapter 6, Data Mining Models in Excel Hands-On Examples.
We calculated the Squared Error (SSE) as ([@mpg]-[@prediction])^2. The other sum we need is SST = ([@mpg]-average([@prediction]))^2. Then, we calculate R² = 1-SSE/SST.
You can try using an exponential function (EXP()) or another function with a similar shape. The R² value will probably still be far from 1, since the dispersion in the data is very high.

...

Chapter 6, Data Mining Models in Excel Hands-On Examples

Use the previous knowledge of the business to discard these associations.
Not necessarily. These types of analysis are usually dependent on the business domain and even on the particular place where we perform them. This means that some results can be generalized, but, often, not all of them.
It means that there is no customer that started buying products by the time indicated in the column and that kept buying after the period of time shown in the row.
There are no customers that old (in terms of time spent as customers).
For example, focusing on those that stop buying and aiming ad campaigns at them.

Chapter 7, Implementing Time Series

By setting increasing(TravelDate) to the moving average values in the calculation and following the same steps.
If the seasonality is too different from the real value in the data, then the prediction will have less accuracy. If we increase the confidence interval, then the error will also increase.
Using the COVARIANCE.P function in Excel.

The time series diagram, after applying the logarithm, will look like the following screenshot:

The trend is still ascending, but the standard deviation looks flat and is not dependent on the time.

Chapter 8, Visualizing Data in Diagrams, Histograms, and Maps

It is very difficult to distinguish the different pie slices.
Multiple line charts.
You can get data from https://openaddresses.io/ and follow the instructions in this article: https://www.roguegeographer.com/create-your-own-maps-using-excel-3d-maps/.
It is possible to do it and get a result, but the accuracy will be bad. The result of an election depends mostly on external factors that are not taken into account by the data, and not so much on the historical results of past elections.

Chapter 9, Artificial Neural Networks

The result will depend on the artificial neural network training. You can follow the step-by-step instructions in the Evaluating models subsection in Chapter 1, Implementing Machine Learning Algorithms.
The dataset is unbalanced and that will affect the results.

Chapter 10, Azure and Excel - Machine Learning in the Cloud

Cost, speed, global scale, productivity, performance, and security.
Cloud computing is useful for many different applications and, in fact, can replace everything that was built on-premise, from databases to visualizations.
Web services are applications hosted on the internet, which can communicate with other applications through predefined protocols and data formats. The advantage of using web services is that they are easy to share and are independent from the operating system and programming language used.
Azure Machine Learning Studio needs the input data format, and this is taken from the input data module.
The training flow is used to train the model and then save it. The same model is then used in a separate flow for prediction, without the need to retrain the model every time it is used.

...

Chapter 11, The Future of Machine Learning

The model training and testing is replaced by data mining, which works by trying to get useful information from the data.
New data is included continuously into the data flow, and the full cycle must be fulfilled before feeding it into a machine learning model.
A hyperparameter value is set before starting the learning process and defines some characteristics of the model (for example, the number of cycles in an artificial neural network training model).
The following steps can be performed automatically by AutoML:
- Data preprocessing
- Feature engineering
- Model selection
- Optimization of the model hyperparameters
- Analysis of the model results

The rest of the chapter is locked

You have been reading a chapter from

Hands-On Machine Learning with Microsoft Excel 2019

Published in: Apr 2019Publisher: PacktISBN-13: 9781789345377

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Julio Cesar Rodriguez Martino

Julio Cesar Rodriguez Martino is a machine learning (ML) and artificial intelligence (AI) platform architect, focusing on applying the latest techniques and models in these fields to optimize, automate, and improve the work of tax and accounting consultants. The main tool used in this practice is the MS Office platform, which Azure services complement perfectly by adding intelligence to the different tasks. Julio's background is in experimental physics, where he learned and applied advanced statistical and data analysis methods. He also teaches university courses and provides in-company training on machine learning and analytics, and has a lot of experience leading data science teams.
Read more about Julio Cesar Rodriguez Martino

Other recommended products

Related to this chapter

Hands-On Financial Modeling with Microsoft Excel 2019

This book aims to provide a gateway to financial modeling through easy to follow examples in Excel 2019. It will explore the process of modeling starting from a thorough understanding of the project to gathering historical information and arriving at assumptions. Good modeling practices are emphasized and demonstrated throughout the book.

BookJul 2019292 pages

Learn Power Query

This book will effectively guide you through Power Query, starting with the shortcomings of other tools with regard to data analysis and management. You’ll then delve into the Power Query interface, understand how to connect, combine, and refine data with query tools, and finally create dashboards and multi-dimensional reports in Power Query.

BookJul 2020428 pages

Automated Machine Learning

This guide will help you to explore automated machine learning (AutoML), a rapidly growing subfield of machine learning. You’ll learn how you can use AutoML to fully automate the machine learning process even if you’re not an expert, and in turn increase your productivity drastically.

BookFeb 2021312 pages

Hands-On Neural Networks

This book will be a journey for beginners who want to step into the world of deep learning and artificial intelligence. It will thoughtfully take you through the training and implementation of various neural network architectures using the Python ecosystem. You will master each neural network architecture while understanding its working mechanism.

BookMay 2019280 pages

Data Science for Marketing Analytics

Data Science for Marketing Analytics opens doors to looking at data with a different approach and new tools. Drawing on machine learning and data science concepts, this book broadens the range of tools that you can use to transform the market analysis process.

BookMar 2019420 pages

Machine Learning for Mobile

This book will help you build intelligent mobile applications for Android and iOS using machine learning. In the process, you will use popular machine learning toolkits such as TensorFlow Lite, Core ML, ML Kit and Fritz to build and deploy state-of-the-art machine learning models for mobile devices.

BookDec 2018274 pages

Hands-On Machine Learning with Azure

This book will teach you how advanced machine learning can be performed in the cloud in a very cheap way. You will learn more about Azure ML processes as an enterprise-ready methodology. By the end of this book, you will implement machine learning and artificial intelligence concepts in your model to solve real-world problems.

BookOct 2018340 pages

Hands-On Exploratory Data Analysis with Python

This book provides practical knowledge about the main pillars of EDA including data cleaning, data preparation, data exploration, and data visualization. You can leverage the power of Python to understand, summarize and investigate your data in the best way possible. The book presents a unique approach to exploring hidden features in your data.

BookMar 2020352 pages

Machine Learning Fundamentals

As machine learning algorithms become popular, new tools that optimize these algorithms are also developed. Machine Learning Fundamentals explains the scikit-learn API, which is a package created to facilitate the process of building machine learning applications. By explaining the differences between supervised and unsupervised models and by applying some popular algorithms to real-life datasets, this course gives you the skills and confidence to start programming machine learning algorithms.

BookNov 2018240 pages

Applied Supervised Learning with Python

Applied Supervised Learning with Python provides you a rich understanding of machine learning, one of the most pursued topics in information science, and Python, one of the most popular scripting languages. Through this book, you'll learn Jupyter Notebooks, the technology used in academic and commercial circles with in-line code running support.

BookApr 2019404 pages

The Machine Learning Workshop

With expert guidance and real-world examples, The Machine Learning Workshop gets you up and running with programming machine learning algorithms. By showing you how to leverage scikit-learn's flexibility, it teaches you all the skills you need to use machine learning to solve real-world problems.

BookJul 2020286 pages

The Supervised Learning Workshop

Taking an engaging and practical approach, The Supervised Learning Workshop teaches you how to predict the output of new data, based on the relationship and behavior of?existing datasets. You’ll learn at your own pace and use Python libraries and Jupyter to build intelligent predictive models.?

BookFeb 2020532 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages