Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Data Wrangling with R

You're reading from  Data Wrangling with R

Product type Book
Published in Feb 2023
Publisher Packt
ISBN-13 9781803235400
Pages 384 pages
Edition 1st Edition
Languages
Concepts
Author (1):
Gustavo R Santos Gustavo R Santos
Profile icon Gustavo R Santos

Table of Contents (21) Chapters

Preface Part 1: Load and Explore Data
Chapter 1: Fundamentals of Data Wrangling Chapter 2: Loading and Exploring Datasets Chapter 3: Basic Data Visualization Part 2: Data Wrangling
Chapter 4: Working with Strings Chapter 5: Working with Numbers Chapter 6: Working with Date and Time Objects Chapter 7: Transformations with Base R Chapter 8: Transformations with Tidyverse Libraries Chapter 9: Exploratory Data Analysis Part 3: Data Visualization
Chapter 10: Introduction to ggplot2 Chapter 11: Enhanced Visualizations with ggplot2 Chapter 12: Other Data Visualization Options Part 4: Modeling
Chapter 13: Building a Model with R Chapter 14: Build an Application with Shiny in R Conclusion Other Books You May Enjoy

Understanding the project

When starting a project, we need a purpose – that is, a goal we want to reach at the end. After all, knowing the problem is part of the solution. Like Lewis Carrol wrote in his book Alice’s Adventures in Wonderland, the Bunny says to Alice that if she does not know where she wants to go, any path will lead her there.

So, let’s begin by understanding the project, or where we want to go.

The dataset

The input data for this project is the Spambase Data Set (https://tinyurl.com/23xwdcah), which can be found in the UCI datasets repository. See the citation information in the Further reading section at the end of this chapter for more.

It contains 4,601 observations and 57 explanatory variables. Out of those, 48 features are floating numbers representing the percentage value, from 0 to 100, of specific words associated with spam and their percentage present in the message. There are six other variables with special characters such...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}