Reader small image

You're reading from  Data Wrangling with R

Product typeBook
Published inFeb 2023
PublisherPackt
ISBN-139781803235400
Edition1st Edition
Concepts
Right arrow
Author (1)
Gustavo R Santos
Gustavo R Santos
author image
Gustavo R Santos

Gustavo R Santos has worked in the Technology Industry for 13 years, improving processes, and analyzing datasets and creating dashboards. Since 2020, he has been working as a Data Scientist in the retail industry, wrangling, analyzing, visualizing and modeling data with the most modern tools like R, Python and Databricks. Gustavo also gives lectures from time to time at an online school about Data Science concepts. He has a background in Marketing, is certified as Data Scientist by the Data Science Academy Brazil and pursues his specialist MBA in Data Science at the University of São Paulo
Read more about Gustavo R Santos

Right arrow

Introduction to stringr

There is a lot to learn about strings. Even though R is a language that was created with statistics in mind, it has developed a lot over the years and many libraries have emerged. As already mentioned, working with strings is a good skill to have given that you will often need to deal with these objects in your daily work as a data scientist. Sentiment analysis of clients or social media, comments analysis in feedback forms, the analysis of textual information scraped from the internet, or simply parsing a city name out of an address are some of the many tasks that can be part of a data wrangling request.

To code along with this chapter, make sure that you have installed and loaded the following libraries. Of them, you may be missing the Gutenberg package. Therefore, I suggest that you use install.package("gutenbergr") before trying to load it. We will use it for an exercise at the end of this chapter:

# Use install.packages("library_name...
lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Data Wrangling with R
Published in: Feb 2023Publisher: PacktISBN-13: 9781803235400

Author (1)

author image
Gustavo R Santos

Gustavo R Santos has worked in the Technology Industry for 13 years, improving processes, and analyzing datasets and creating dashboards. Since 2020, he has been working as a Data Scientist in the retail industry, wrangling, analyzing, visualizing and modeling data with the most modern tools like R, Python and Databricks. Gustavo also gives lectures from time to time at an online school about Data Science concepts. He has a background in Marketing, is certified as Data Scientist by the Data Science Academy Brazil and pursues his specialist MBA in Data Science at the University of São Paulo
Read more about Gustavo R Santos