Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
10 Machine Learning Blueprints You Should Know for Cybersecurity

You're reading from  10 Machine Learning Blueprints You Should Know for Cybersecurity

Product type Book
Published in May 2023
Publisher Packt
ISBN-13 9781804619476
Pages 330 pages
Edition 1st Edition
Languages
Author (1):
Rajvardhan Oak Rajvardhan Oak
Profile icon Rajvardhan Oak

Table of Contents (15) Chapters

Preface Chapter 1: On Cybersecurity and Machine Learning Chapter 2: Detecting Suspicious Activity Chapter 3: Malware Detection Using Transformers and BERT Chapter 4: Detecting Fake Reviews Chapter 5: Detecting Deepfakes Chapter 6: Detecting Machine-Generated Text Chapter 7: Attributing Authorship and How to Evade It Chapter 8: Detecting Fake News with Graph Neural Networks Chapter 9: Attacking Models with Adversarial Machine Learning Chapter 10: Protecting User Privacy with Differential Privacy Chapter 11: Protecting User Privacy with Federated Machine Learning Chapter 12: Breaking into the Sec-ML Industry Index Other Books You May Enjoy

Naïve detection

In this section, we will focus on naïve methods for detecting bot-generated text. We will first create our own dataset, extract features, and then apply machine learning models to determine whether a particular text is machine-generated or not.

Creating the dataset

The task we will focus on is detecting bot-generated fake news. However, the concepts and techniques we will learn are fairly generic and can be applied to parallel tasks such as detecting bot-generated tweets, reviews, posts, and so on. As such a dataset is not readily available to the public, we will create our own.

How are we creating our dataset? We will use the News Aggregator dataset (https://archive.ics.uci.edu/ml/datasets/News+Aggregator) from the UCI Dataset Repository. The dataset contains a set of news articles (that is, links to the articles on the web). We will scrape these articles, and these are our human-generated articles. Then, we will use the article title as a prompt...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}