Reader small image

You're reading from  Data Wrangling with R

Product typeBook
Published inFeb 2023
PublisherPackt
ISBN-139781803235400
Edition1st Edition
Concepts
Right arrow
Author (1)
Gustavo R Santos
Gustavo R Santos
author image
Gustavo R Santos

Gustavo R Santos has worked in the Technology Industry for 13 years, improving processes, and analyzing datasets and creating dashboards. Since 2020, he has been working as a Data Scientist in the retail industry, wrangling, analyzing, visualizing and modeling data with the most modern tools like R, Python and Databricks. Gustavo also gives lectures from time to time at an online school about Data Science concepts. He has a background in Marketing, is certified as Data Scientist by the Data Science Academy Brazil and pursues his specialist MBA in Data Science at the University of São Paulo
Read more about Gustavo R Santos

Right arrow

Working with multiple variables

A graphic can have more than two variables, not just what is plotted on the x and y axes. We can use colors, marker shapes, or sizes to differentiate data points and create a more complex visual. Look at these basic examples.

Scatterplots are the best fit for multiple variate plots, as the points can be changed to other shapes, sizes, or colors and produce a very rich visual. Knowing that the number of cylinders (cyl) and horsepower (hp) affect directly the fuel efficiency of a car (mpg), a good exploration point is visualizing the effect of increasing cylinders and HP and observing how the fuel efficiency will respond. To perform the task, we plot a scatterplot that shows the relationship between the engine’s HP with the MPG presented by the car. Then, we add the cylinder information as a third variable to control the size of the bubbles, making them larger or smaller, thus bringing more information to this graphic:

# Scatterplot 3 variables...
lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Data Wrangling with R
Published in: Feb 2023Publisher: PacktISBN-13: 9781803235400

Author (1)

author image
Gustavo R Santos

Gustavo R Santos has worked in the Technology Industry for 13 years, improving processes, and analyzing datasets and creating dashboards. Since 2020, he has been working as a Data Scientist in the retail industry, wrangling, analyzing, visualizing and modeling data with the most modern tools like R, Python and Databricks. Gustavo also gives lectures from time to time at an online school about Data Science concepts. He has a background in Marketing, is certified as Data Scientist by the Data Science Academy Brazil and pursues his specialist MBA in Data Science at the University of São Paulo
Read more about Gustavo R Santos