You're reading from Automated Machine Learning with AutoKeras

Product typeBook

Published inMay 2021

Reading LevelBeginner

PublisherPackt

ISBN-139781800567641

Edition1st Edition

Languages

Python

Tools

Keras

Concepts

Deep Learning

Author (1)

Luis Sobrecueva

Chapter 7: Sentiment Analysis Using AutoKeras

Let's start by defining the unusual term in the title. Sentiment analysis is a term that's widely used in text classification and it is basically about using natural language processing (NLP) in conjunction with machine learning (ML) to interpret and classify emotions in text.

To get an idea of this, let's imagine the task of determining whether a review for a film is positive or negative. You could do this yourself just by reading it, right? However, if our boss sends us a list of 1,000 movie reviews for tomorrow, things become complicated. That's where sentiment analysis becomes an interesting option.

In this chapter, we will use a text classifier to extract sentiments from text data. Most of the concepts of text classification were already explained in Chapter 4, Image Classification and Regression Using AutoKeras, so in this chapter, we will apply them in a practical way by implementing a sentiment predictor...

Technical requirements

All the code examples in this book are available as Jupyter notebooks that can be downloaded from https://github.com/PacktPublishing/Automated-Machine-Learning-with-AutoKeras.

Since code cells can be executed, each notebook can be self-installed; you just need to add the code snippet with the requirements you need. For this reason, at the beginning of each notebook, there is a code cell for environment setup that installs AutoKeras and its dependencies.

So, to run the code examples for this chapter, you only need a computer with Ubuntu Linux as your OS and install the Jupyter Notebook with the following code:

$ apt-get install python3-pip jupyter-notebook

Alternatively, you can also run these notebooks using Google Colaboratory, in which case you will only need a web browser. See the AutoKeras with Google Colaboratory section of Chapter 2, Getting Started with AutoKeras, for more details. Furthermore, in the Installing AutoKeras section of that chapter...

Creating a sentiment analyzer

The model we are going to create will be a binary classifier for sentiments (1=Positive/0=Negative) from the IMDb sentiments dataset. This is a dataset for binary sentiment classification that contains a set of 25,000 sentiment labeled movie reviews for training and 25,000 for testing:

Figure 7.1 – Example of sentiment analysis being used on two samples

Similar to the Reuters example from Chapter 4, Image Classification and Regression Using AutoKeras, each review is encoded as a list of word indexes (integers). For convenience, words are indexed by their overall frequency in the dataset. So, for instance, the integer 3 encodes the third most frequent word in the data.

The notebook that contains the complete source code can be found at https://github.com/PacktPublishing/Automated-Machine-Learning-with-AutoKeras/blob/main/Chapter07/Chapter7_IMDB_sentiment_analysis.ipynb.

Now, let's have a look at the relevant...

Creating the sentiment predictor

Now, we will use the AutoKeras TextClassifier to find the best classification model. Just for this example, we will set max_trials (the maximum number of different Keras models to try) to 2; we do not need to set the epochs parameter; instead, we must define an EarlyStopping callback of 2 epochs so that the training process stops if the validation loss does not improve in two consecutive epochs:

clf = ak.TextClassifier(max_trials=2)
cbs = [tf.keras.callbacks.EarlyStopping(patience=2)]

Let's run the training process and search for the optimal classifier for the training dataset:

clf.fit(x_train, y_train, callbacks=cbs)

Here is the output:

Figure 7.3 – Notebook output of text classifier training

The previous output shows that the accuracy of the training dataset is increasing.

As we can see, we are getting a loss of 0.28 in the validation set. This isn't bad just for a few minutes of training...

Evaluating the model

Now, it's time to evaluate the best model with the testing dataset:

clf.evaluate(x_test, y_test)

Here is the output:

782/782 [==============================] - 41s 52ms/step - loss: 0.3118 - accuracy: 0.8724
 
[0.31183066964149475, 0.8723599910736084]

As we can see, 0.8724 is a really good final prediction accuracy for the time we've invested.

Visualizing the model

Now, we can view a little summary of the architecture for the best generated model:

model = clf.export_model()
model.summary()

Here is the output:

Figure 7.4 – Best model architecture summary

As we can see, AutoKeras, as we did in the classification example in Chapter 4, Image Classification and Regression Using AutoKeras, has chosen a convolution model (Conv1D) for this task. As we explained in the beginning of that chapter, this kind of architecture works really well when the order of the input sentences is not important for the prediction; there are no correlations between the different movie reviews.

Here is a visual representation of this:

Figure 7.5 – Best model architecture visualization graph

As you already know, generating the models and choosing the best one is done by AutoKeras automatically, but let's explain these blocks in more detail.

Each block represents...

Analyzing the sentiment in specific sentences

Now, let's take a look at some predicted samples from the test set:

import tensorflow as tf
tf.get_logger().setLevel('ERROR')
def get_sentiment(val):
    return "Positive" if val == 1 else "Negative"
for i in range(10):
    print(x_test[i])
    print("label: %s, prediction: %s" % (get_sentiment(y_test[i][0]), get_sentiment(clf.predict(x_test[i:i+1])[0][0])))

Here is the output of the preceding code:

Figure 7.6 – Some predictions based on the first 10 sentences of the test dataset

As you can see, the model predictions match every label for the first 10 samples in the test dataset.

Summary

In this chapter, we learned about the importance of sentiment analysis in the real world, as well as how to extract sentiments from text data and how to implement a sentiment predictor in just a few lines of code.

In the next chapter, we will cover a very interesting topic: we will use AutoKeras to classify news topics based on their content by using a text classifier.

The rest of the chapter is locked

You have been reading a chapter from

Automated Machine Learning with AutoKeras

Published in: May 2021Publisher: PacktISBN-13: 9781800567641

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Luis Sobrecueva

Luis Sobrecueva is a senior software engineer and ML/DL practitioner currently working at Cabify. He has been a contributor to the OpenAI project as well as one of the contributors to the AutoKeras project.
Read more about Luis Sobrecueva

Other recommended products

Related to this chapter

Automated Machine Learning

This guide will help you to explore automated machine learning (AutoML), a rapidly growing subfield of machine learning. You’ll learn how you can use AutoML to fully automate the machine learning process even if you’re not an expert, and in turn increase your productivity drastically.

BookFeb 2021312 pages

Keras 2.x Projects

Keras is a deep learning library that enables the fast, efficient training of deep learning models. The book begins with setting up the environment, training various types of models in the domain of deep learning and reinforcement learning. The projects are exciting and are real-world market demanding projects which take you from simple to complex level.

BookDec 2018394 pages

TensorFlow 2.0 Computer Vision Cookbook

This book covers recipes for solving various computer vision tasks using TensorFlow, taking you through all the tips and tricks you need to overcome any challenges that you may face while building various computer vision applications. You will discover machine learning techniques to solve problems in image processing, feature extraction, and more.

BookFeb 2021542 pages

Machine Learning Automation with TPOT

If you are a developer looking to build machine learning models without spending months and years learning machine learning prerequisites, look no further than AutoML. This practical and concise guide will show you how to build automated models for regression and classification, both with traditional algorithms and neural networks.

BookMay 2021270 pages

Advanced Deep Learning with R

This book will help readers to apply deep learning algorithms in R using advanced examples. You will cover variants of neural network models such as ANN, CNN, RNN, LSTM, and more using expert techniques. Readers will make use of popular deep learning libraries such as Keras-R, Tensorflow-R, and more to implement AI models.

BookDec 2019352 pages

Hands-On Automated Machine Learning

This book helps machine learning professionals in developing AutoML systems that can be utilized to build ML solutions. This book covers the necessary foundations and shows the most practical ways possible to get to speed with regards to creating AutoML modules.

BookApr 2018282 pages

The Applied TensorFlow and Keras Workshop

The Applied TensorFlow and Keras Workshop provides you with a blueprint to build an application that generates predictions using a deep learning model. You’ll learn to apply techniques to improve the model: add more data and features, change its architecture, or create a new model by changing the core components to meet your own requirements.

BookJul 2020174 pages

Deep Learning with Microsoft Cognitive Toolkit Quick Start Guide

Cognitive Toolkit is one of the most popular and recently open sourced deep learning toolkit by Microsoft. Cognitive Toolkit is used to train fast and effective deep learning models. This book will be a quick introduction to using Cognitive Toolkit and will teach you how to train and validate different types of neural networks.

BookMar 2019208 pages

Master Data Science with Python

Data Science with Python will help you get comfortable with using the Python environment for data science. You will learn all the libraries that a data scientist uses on a daily basis. By the end of this course, you will be able to take a large raw dataset, clean it, manipulate it, and run machine learning algorithms to obtain results that influence business decisions.

BookJul 2019426 pages

What's New in TensorFlow 2.0

This book will cover all the new features that have been introduced in TensorFlow 2.0 especially the major highlight, including eager execution and more. You will learn how to make the best use of these features to migrate your codes from TensorFlow 1.x to TensorFlow 2.0 in a seamless way.

BookAug 2019202 pages

Deep Learning with TensorFlow 2 and Keras

Deep Learning with TensorFlow 2 and Keras, Second Edition teaches deep learning techniques alongside TensorFlow (TF) and Keras. The book introduces neural networks with TensorFlow, runs through the main applications, covers two working example apps, and then dives into TF and cloudin production, TF mobile, and using TensorFlow with AutoML.

BookDec 2019646 pages

Hands-On Transfer Learning with Python

The purpose of this book is two-fold, we focus on detailed coverage of deep learning and transfer learning, comparing and contrasting the two with easy-to-follow concepts and examples. The second area of focus is on real-world examples and research problems using TensorFlow, Keras and Python ecosystem with hands-on examples.

BookAug 2018438 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages