Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Data Wrangling with R

You're reading from  Data Wrangling with R

Product type Book
Published in Feb 2023
Publisher Packt
ISBN-13 9781803235400
Pages 384 pages
Edition 1st Edition
Languages
Concepts
Author (1):
Gustavo R Santos Gustavo R Santos
Profile icon Gustavo R Santos

Table of Contents (21) Chapters

Preface Part 1: Load and Explore Data
Chapter 1: Fundamentals of Data Wrangling Chapter 2: Loading and Exploring Datasets Chapter 3: Basic Data Visualization Part 2: Data Wrangling
Chapter 4: Working with Strings Chapter 5: Working with Numbers Chapter 6: Working with Date and Time Objects Chapter 7: Transformations with Base R Chapter 8: Transformations with Tidyverse Libraries Chapter 9: Exploratory Data Analysis Part 3: Data Visualization
Chapter 10: Introduction to ggplot2 Chapter 11: Enhanced Visualizations with ggplot2 Chapter 12: Other Data Visualization Options Part 4: Modeling
Chapter 13: Building a Model with R Chapter 14: Build an Application with Shiny in R Conclusion Other Books You May Enjoy

Creating new variables

Creating new variables can be useful for data scientists when they need to analyze something that is not present in the data as it was acquired. Common tasks to create new data are splitting a column, creating a calculation, encoding text, and applying a custom function over a variable.

We went over some good examples of column splitting in this book, such as a datetime split. Now, to illustrate the separate() function from tidyr, the example to be used is based on the Census Income dataset. Look at the target column: it has values such as <=50k and > 50k. Let’s say we wanted to separate only the > or <= signs and put them in a separate column; here is how to do that:

# Split variable target into sign and amount
df_no_na %>% separate(target, into=c("sign", "amt"), sep="\\b")

We took the dataset clean of NAs and separated the target column into two new variables: sign and amt. To accomplish that, the...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}