Packt+ | Advance your knowledge in tech

You're reading from Machine Learning for Mobile

Product typeBook

Published inDec 2018

PublisherPackt

ISBN-139781788629355

Edition1st Edition

Tools

Android TensorFlow

Concepts

Machine Learning

Authors (2):

Revathi Gopalakrishnan

Avinash Venkateswarlu

View More author details

Chapter 7. Spam Message Detection

This chapter will provide you with an overview of natural language processing (NLP) and discuss how NLP can be combined with machine learning to provide solutions to problems. Then, the chapter will take a real-world use case of doingspam message detection by utilizing NLP, combined with the linear SVM classification model. The program will be implemented as a mobile application using Core ML for iOS.

To handle text in machine learning algorithms, we will go through the various NLP techniques that will be used on the text data to make it ready for learning algorithms. Once the text is prepared, we will see how we can classify it using the linear SVM model.

Problem definition: The bulk SMS message data is provided, and these messages need to be classified as spam or non-spam messages.

We will be covering the following topics in this chapter:

Understanding NLP
Understanding the linear SVM algorithm
Solving the problem using linear SVM in Core ML:
- Technical requirements...

Understanding NLP

NLP is a huge topic, and it is beyond the scope of this book to go into detail on the subject. However, in this section, we will go through the high-level details of NLP and try to understand the key concepts required to prepare and process the textual data using NLP, in order to make it ready for consumption by machine learning algorithms for prediction.

Introducing NLP

Huge, unstructured textual data is getting generated on a daily basis. Social media, websites such as Twitter and Facebook, and communication apps, such as WhatsApp, generate an enormous volume of this unstructured data daily—not to mention the volume created by blogs, news articles, product reviews, service reviews, advertisements, emails, and SMS. So, to summarize, there is huge data (in TBS).

However, it is not possible for a computer to get any insight from this data and to carry out specific actions based on the insights, directly from this huge data, because of the following reasons:

The data is unstructured...

Understanding linear SVM algorithm

InChapter 2, Supervised and Unsupervised Learning Algorithms, we covered the SVM algorithm and now have an idea of how the SVM model works. A linear support vector machine or linear SVM is a linear classifier that tries to find a hyperplane with the largest margin that splits the input space into two regions.

Note

A hyperplane is a generalization of a plane. In one dimension, a hyperplane is called a point. In two dimensions, it is a line. In three dimensions, it is a plane. In more dimensions, you can call it a hyperplane.

As we saw, the goal of SVM is to identify the hyperplane that tries to find the largest margin that splits the input space into two regions. If the input space is linearly separable, it is easy to separate them. However, in real life, we find that the input space is very non-linear:

In the preceding scenario, the SVM can help us separate the red and blue balls by using what is called a Kernel Trick, which is the method of using a linear...

Solving the problem using linear SVM in Core ML

In this section, we are going to look at how we can solve the spam message detection problem using all the concepts we have gone through in this chapter.

We are going to take a bunch of SMS messages and attempt to classify them as spam or non-spam. This is a classification problem and we will use the linear SVM algorithm to perform this, considering the advantages of using this algorithm for text classification.

We are going to use NLP techniques to convert the data-SMS messages into a feature vector to feed into the linear SVM algorithm. We are going to use the scikit-learn vectorizer methods to transform the SMS messages into the TF-IDF vector, which could be fed into the linear SVM model to perform SMS spam detection (classification into spam and non-spam).

About the data

The data that we are using to create the model that detects the spam messages is taken from http://www.dt.fee.unicamp.br/~tiago/smsspamcollection/, which contains 747 spam...

Summary

In this chapter, we went through many things, such as, understanding NLP at a high level. There are various steps involved in NLP, such as text preprocessing, as well as techniques to carry this out, such as feature engineering and methods to perform feature engineering and classification or clustering of the feature vectors. We also looked into the linear SVM algorithm in which we went through the details of the SVM algorithm, the kernel function, and how it is more applicable to text classification.

We solved our problem using linear SVM in Core ML and we also saw a practical example of performing spam message detection using the linear SVM algorithm model that we developed in scikit learn and converted into a Core ML model. We wrote an iOS application using the converted Core ML model.

In the next chapter, we will be introduced to another ML framework, Fritz, which tries to solve the common problems that we see in model deployment and upgrades, and the unification of handling ML...

The rest of the chapter is locked

You have been reading a chapter from

Machine Learning for Mobile

Published in: Dec 2018Publisher: PacktISBN-13: 9781788629355

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Authors (2)

Revathi Gopalakrishnan

Revathi Gopalakrishnan is a software professional with more than 17 years of experience in the IT industry. She has worked extensively in mobile application development and has played various roles, including developer and architect, and has led various enterprise mobile enablement initiatives for large organizations. She has also worked on a host of consumer applications for various customers around the globe. She has an interest in emerging areas, and machine learning is one of them. Through this book, she has tried to bring out how machine learning can make mobile application development more interesting and super cool. Revathi resides in Chennai and enjoys her weekends with her husband and her two lovely daughters.
Read more about Revathi Gopalakrishnan

Avinash Venkateswarlu

Avinash Venkateswarlu has more than 3 years' experience in IT and is currently exploring mobile machine learning. He has worked in enterprise mobile enablement projects and is interested in emerging technologies such as mobile machine learning and cryptocurrency. Venkateswarlu works in Chennai, but enjoys spending his weekends in his home town, Nellore. He likes to do farming or yoga when he is not in front of his laptop exploring emerging technologies.
Read more about Avinash Venkateswarlu

Other recommended products

Related to this chapter

Python Machine Learning Workbook for Beginners

Through a series of machine learning and data science projects, this book represents a beginner-friendly crash course to Python’s practical application in businesses and your own career.

BookMar 2021279 pages

Machine Learning Projects for Mobile Applications

Machine learning on mobile devices is the next big thing. This book presents the implementation of 7 practical, real-world projects that will teach you how to leverage TensorFlow Lite and Core ML to perform efficient machine learning on a cross-platform mobile OS. You will get to work on image, text, and video datasets through these projects.

BookOct 2018246 pages

Intelligent Mobile Projects with TensorFlow

Google TensorFlow is used to train all the models deployed and running on mobile devices. This book covers 10 projects on the implementation of all major AI areas of iOS, Android, and Raspberry Pi: computer vision, speech and language processing, and machine learning.

BookMay 2018404 pages

Machine Learning with Core ML

Discover the world of ML through the lens and application of Core ML. We will take you through examples; each example provides a new use case uncovering how ML can be applied specifically to computer vision tasks. By the end of the book, you will have the intuition and skills required to boost your iOS applications with the help of machine learning.

BookJun 2018378 pages

Mobile Artificial Intelligence Projects

Artificial intelligence (AI) is rapidly becoming the most popular topic in business and science. This book introduces AI concepts and their use cases with a hands-on and application-focused approach. We will cover a range of projects covering tasks such as automated reasoning, facial recognition, digital assistants, auto text generation, and more.

BookMar 2019312 pages

Mastering Firebase for Android Development

Firebase is a completely scalable, real-time backend service and provides all the tools necessary to develop rich, collaborative applications using client side code. This books will take deep dive into the features of Firebase by exploring its complete toolchain.

BookJun 2018394 pages

Mobile Deep Learning with TensorFlow Lite, ML Kit and Flutter

Deep learning is rapidly becoming the most popular topic in the industry. This book introduces trending deep learning concepts and their use cases with an industrial and application-focused approach. You will cover a range of projects covering tasks such as mobile vision, facial recognition, smart AI assistant, augmented reality, and more.

BookApr 2020380 pages

Hands-On Machine Learning on Google Cloud Platform

In this book, you will learn how to create powerful machine learning based applications for a wide variety of problems leveraging different data services from the Google Cloud Platform. Finally, you will know the main difficulties that you may encounter and get appropriate strategies to overcome these difficulties and build efficient systems.

BookApr 2018500 pages

Hands-On Machine Learning with Microsoft Excel 2019

Machine learning has become a core necessity for every business and organization. With this book, you will learn to analyze your Excel data to search for patterns and return a series of interesting facts or trends about the data. You will learn to perform machine learning tasks using Excel plugins and APIs without much code required.

BookApr 2019254 pages

Machine Learning with Swift

Machine learning has become a hot topic for developers who want to impart intelligent functionality to their applications. In this book, we'll show you how to incorporate various machine learning libraries available for iOS developers. You’ll quickly get acquainted with the machine learning fundamentals and implement various algorithms with Swift.

BookFeb 2018378 pages

Hands-On Machine Learning with IBM Watson

A practical guide on Machine learning with IBM cloud to act as a solid yet concise reference for the readers. You will learn about the role of data representation and feature extraction in machine learning. This book will help you learn how to use the IBM Cloud and Watson Machine learning service to develop real-world machine learning solutions.

BookMar 2019288 pages

Hands-On Artificial Intelligence on Google Cloud Platform

This book focuses on the use of powerful AI tools offered by Google Cloud Platform to develop and design intelligent applications on the cloud. You will start with topics that set the foundation for using GCP with various powerful libraries, and then move on to building end to end AI applications using them.

BookMar 2020350 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages