You're reading from Interpretable Machine Learning with Python - Second Edition

Product typeBook

Published inOct 2023

PublisherPackt

ISBN-139781803235424

Edition2nd Edition

Concepts

Machine Learning

Author (1)

Serg Masís

Mission accomplished

The mission was to understand why one of your client's bars is Outstanding while another one is Disappointing. Your approach employed the interpretation of machine learning models to arrive at the following conclusions:

According to SHAP on the tabular model, the Outstanding bar owes that rating to its berry taste and its cocoa percentage of 70%. On the other hand, the unfavorable rating for the Disappointing bar is due mostly to its earthy flavor and bean country of origin (Other). Review date plays a smaller role, but it seems that chocolate bars reviewed in that period (2013-15) were at an advantage.
LIME confirms that cocoa_percent<=70 is a desirable property, and that, in addition to berry, creamy, cocoa, and rich are favorable tastes, while sweet, sour, and molasses are unfavorable.
The commonality between both methods using the tabular model is that despite the many non-taste-related attributes, taste features are among the most salient. Therefore...

Summary

After reading this chapter, you should know how to use SHAP's KernelExplainer, as well as its decision and force plot to conduct local interpretations. You also should know how to do the same with LIME's instance explainer for both tabular and text data. Lastly, you should understand the strengths and weaknesses of SHAP's KernelExplainer and LIME. In the next chapter, we will learn how to create even more human-interpretable explanations of a model's decisions, such as "if X conditions are met, then Y is the outcome".

Dataset sources

Brelinski, Brady (2020). Manhattan Chocolate Society. http://flavorsofcacao.com/mcs_index.html

The preparations

You will find the code for this example here: https://github.com/PacktPublishing/Interpretable-Machine-Learning-with-Python-2E/blob/main/05/ChocoRatings.ipynb

Loading the libraries

To run this example, you need to install the following libraries:

mldatasets to load the dataset
pandas, numpy, and nltk to manipulate it
sklearn (scikit-learn) and lightgbm to split the data and fit the models
matplotlib, seaborn, shap, and lime to visualize the interpretations

You should load all of them first, as follows:

import math
import mldatasets
import pandas as pd
import numpy as np
import re
import nltk
from nltk.probability import FreqDist
from nltk.tokenize import word_tokenize
from sklearn.model_selection import train_test_split
from sklearn.pipeline import make_pipeline
from sklearn import metrics, svm
from sklearn.feature_extraction.text import TfidfVectorizer
import lightgbm as lgb
import matplotlib.pyplot as...

Leveraging SHAP’s KernelExplainer for local interpretations with SHAP values

For this section, and for subsequent use, we will train a Support Vector Classifier (SVC) model first.

Training a C-SVC model

SVM is a family of model classes that operate in high-dimensional space to find an optimal hyperplane, where they attempt to separate the classes with the maximum margin between them. Support vectors are the points closest to the decision boundary (the dividing hyperplane) that would change it if were removed. To find the best hyperplane, they use a cost function called hinge loss and a computationally cheap method to operate in high-dimensional space, called the kernel trick, and even though a hyperplane suggests linear separability, it’s not always limited to a linear kernel.

The scikit-learn implementation we will use is called C-SVC. SVC uses an L2 regularization parameter called C and, by default, uses a kernel called the Radial Basis Function (RBF),...

Employing LIME

Until now, the model-agnostic interpretation methods we’ve covered attempt to reconcile the totality of outputs of a model with its inputs. For these methods to get a good idea of how and why X becomes y_pred, we need some data first. Then, we perform simulations with this data, pushing variations of it into a model and evaluating what comes out of the model. Sometimes, they even leverage a global surrogate to connect the dots. By using what we learned in this process, we yield feature importance values that quantify a feature’s impact, interactions, or decisions on a global level. For many methods such as SHAP, these can be observed locally too. However, even when they can be observed locally, what was quantified globally may not apply locally. For this reason, there should be another approach that quantifies the local effects of features solely for local interpretation—one such as LIME!

What is LIME?

LIME trains local surrogates to explain...

Using LIME for NLP

At the beginning of the chapter, we set aside training and test datasets with the cleaned-up contents of all the “tastes” columns for NLP. We can take a peek at the test dataset for NLP, as follows:

print(X_test_nlp)

This outputs the following:

1194                 roasty nutty rich
77      roasty oddly sweet marshmallow
121              balanced cherry choco
411                sweet floral yogurt
1259           creamy burnt nuts woody
                     ...              
327          sweet mild molasses bland
1832          intense fruity mild sour
464              roasty sour milk note
2013           nutty fruit sour floral
1190           rich roasty nutty smoke
Length: 734, dtype: object

No machine learning model can ingest the data as text, so we need to turn it into a numerical format—in other words, vectorize it. There are many techniques we can use to do this. In our case, we are not interested in the position of words...

Trying SHAP for NLP

Most of SHAP’s explainers will work with tabular data. DeepExplainer can do text but is restricted to deep learning models, and, as we will cover in Chapter 7, Visualizing Convolutional Neural Networks, three of them do images, including KernelExplainer. In fact, SHAP’s KernelExplainer was designed to be a general-purpose, truly model-agnostic method, but it’s not promoted as an option for NLP. It is easy to understand why: it’s slow, and NLP models tend to be very complex and with hundreds—if not thousands—of features to boot. In cases such as this one, where word order is not a factor and you have a few hundred features, but the top 100 are present in most of your observations, KernelExplainer could work.

In addition to overcoming the high computation cost, there are a couple of technical hurdles you would need to overcome. One of them is that KernelExplainer is compatible with a pipeline, but it expects a single...

Comparing SHAP with LIME

As you will have noticed by now, both SHAP and LIME have limitations, but they also have strengths. SHAP is grounded in game theory and approximate Shapley values, so its SHAP values are supported by theory. These have great properties such as additivity, efficiency, and substitutability that make them consistent but violate the dummy property. It always adds up and doesn’t need parameter tuning to accomplish this. However, it’s more suited for global interpretations, and one of its most model-agnostic explainers, KernelExplainer, is painfully slow. KernelExplainer also deals with missing values by using random ones, which can put too much weight on unlikely observations.

LIME is speedy, very model-agnostic, and adaptable to all kinds of data. However, it’s not grounded on strict and consistent principles but has the intuition that neighbors are alike. Because of this, it can require tricky parameter tuning to define the neighborhood...

Mission accomplished

The mission was to understand why one of your client’s bars is Outstanding while another one is Disappointing. Your approach employed the interpretation of machine learning models to arrive at the following conclusions:

According to SHAP on the tabular model, the Outstanding bar owes that rating to its berry taste and its cocoa percentage of 70%. On the other hand, the unfavorable rating for the Disappointing bar is due mostly to its earthy flavor and bean country of origin (Other). Review date plays a smaller role, but it seems that chocolate bars reviewed in that period (2013–15) were at an advantage.
LIME confirms that cocoa_percent<=70 is a desirable property, and that, in addition to berry, creamy, cocoa, and rich are favorable tastes, while sweet, sour, and molasses are unfavorable.
The commonality between both methods using the tabular model is that despite the many non-taste-related attributes, taste features are...

Summary

In this chapter, we learned how to use SHAP’s KernelExplainer, as well as its decision and force plot to conduct local interpretations. We carried out a similar analysis using LIME’s instance explainer for both tabular and text data. Lastly, we looked at the strengths and weaknesses of SHAP’s KernelExplainer and LIME. In the next chapter, we will learn how to create even more human-interpretable explanations of a model’s decisions, such as if X conditions are met, then Y is the outcome.

Dataset sources

Brelinski, Brady (2020). Manhattan Chocolate Society: http://flavorsofcacao.com/mcs_index.html

Platt, J. C., 1999, Probabilistic Outputs for Support Vector Machines and Comparisons to Regularized Likelihood Methods. Advances in Large Margin Classifiers, MIT Press: https://www.cs.colorado.edu/~mozer/Teaching/syllabi/6622/papers/Platt1999.pdf
Lundberg, S. and Lee, S., 2017, A Unified Approach to Interpreting Model Predictions. https://arxiv.org/abs/1705.07874 (documentation for SHAP: https://github.com/slundberg/shap)
Ribeiro, M. T., Singh, S., and Guestrin, C., 2016, “Why Should I Trust You?”: Explaining the Predictions of Any Classifier. Proceedings of the 22^nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining: http://arxiv.org/abs/1602.04938
Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q., and Liu, T., 2017, LightGBM: A Highly Efficient Gradient Boosting Decision Tree. Advances in Neural Information Processing Systems vol. 30, pp. 3149–3157: https://papers.nips.cc/paper...

The rest of the chapter is locked

You have been reading a chapter from

Interpretable Machine Learning with Python - Second Edition

Published in: Oct 2023Publisher: PacktISBN-13: 9781803235424

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €14.99/month. Cancel anytime

Author (1)

Serg Masís

Serg Masís has been at the confluence of the internet, application development, and analytics for the last two decades. Currently, he's a climate and agronomic data scientist at Syngenta, a leading agribusiness company with a mission to improve global food security. Before that role, he co-founded a start-up, incubated by Harvard Innovation Labs, that combined the power of cloud computing and machine learning with principles in decision-making science to expose users to new places and events. Whether it pertains to leisure activities, plant diseases, or customer lifetime value, Serg is passionate about providing the often-missing link between data and decision-making—and machine learning interpretation helps bridge this gap robustly.
Read more about Serg Masís

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages

You're reading from Interpretable Machine Learning with Python - Second Edition

Mission accomplished

Summary

Dataset sources

Further reading

The preparations

Loading the libraries

Leveraging SHAP’s KernelExplainer for local interpretations with SHAP values

Training a C-SVC model

Employing LIME

What is LIME?

Using LIME for NLP

Trying SHAP for NLP

Comparing SHAP with LIME

Mission accomplished

Summary

Dataset sources

Further reading

Author (1)

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Mastering Tableau 2023

Building AI Applications with ChatGPT APIs

Building AI Applications with ChatGPT APIs

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

Modern Data Architecture on AWS

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

TinyML Cookbook