Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Data Wrangling with R

You're reading from  Data Wrangling with R

Product type Book
Published in Feb 2023
Publisher Packt
ISBN-13 9781803235400
Pages 384 pages
Edition 1st Edition
Languages
Concepts
Author (1):
Gustavo R Santos Gustavo R Santos
Profile icon Gustavo R Santos

Table of Contents (21) Chapters

Preface Part 1: Load and Explore Data
Chapter 1: Fundamentals of Data Wrangling Chapter 2: Loading and Exploring Datasets Chapter 3: Basic Data Visualization Part 2: Data Wrangling
Chapter 4: Working with Strings Chapter 5: Working with Numbers Chapter 6: Working with Date and Time Objects Chapter 7: Transformations with Base R Chapter 8: Transformations with Tidyverse Libraries Chapter 9: Exploratory Data Analysis Part 3: Data Visualization
Chapter 10: Introduction to ggplot2 Chapter 11: Enhanced Visualizations with ggplot2 Chapter 12: Other Data Visualization Options Part 4: Modeling
Chapter 13: Building a Model with R Chapter 14: Build an Application with Shiny in R Conclusion Other Books You May Enjoy

Modeling

Training

Now that the new dataset has been created, the next step is to replace 1 with is_spam and 0 with not_spam so that the random forest algorithm can understand that the target variable is not numeric and that it is a classification model. We can do this by using the recode() function within a mutate function:

# Replace the binary 1(spam) and 0(not_spam)
spam_for_model <- spam_for_model %>% 
  mutate( spam= recode(spam, '1'='is_spam','0'='not_spam')    )

Now, it is time to separate the data into train and test subsets. The train subset is used to present the model with the patterns and the labels associated with it so that it can study how to classify each observation according to the patterns that occur. The test set is like a school test, where new data is presented to the trained model so that we can measure how accurate it is or how much it has learned.

As we learned during the...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}