Reader small image

You're reading from  Jupyter Cookbook

Product typeBook
Published inApr 2018
Reading LevelIntermediate
PublisherPackt
ISBN-139781788839440
Edition1st Edition
Languages
Tools
Right arrow
Author (1)
Dan Toomey
Dan Toomey
author image
Dan Toomey

Dan Toomey has been developing application software for over 20 years. He has worked in a variety of industries and companies, in roles from sole contributor to VP/CTO-level. For the last few years, he has been contracting for companies in the eastern Massachusetts area. Dan has been contracting under Dan Toomey Software Corp. Dan has also written R for Data Science, Jupyter for Data Sciences, and the Jupyter Cookbook, all with Packt.
Read more about Dan Toomey

Right arrow

Producing a Scatter plot matrix using R


A Scatter plot matrix is a useful device to display a miniature Scatter plot of every variable in your dataset against every other variable. The resulting display gives you a quick scan to determine variables that may be related.

How to do it...

Use this script:

# load the iris dataset
data <- read.csv("http://archive.ics.uci.edu/ml/machine-learning-databases/iris/iris.data")

#Let us also clean up the data so as to be more readable
colnames(data) <- c("sepal_length", "sepal_width", "petal_length", "petal_width", "species")

pairs(data)

This produces this graphic:

The pairs graphic shows petal width and petal length as related (fairly good straight lines of the plot points), and little relationship between sepal length and sepal width.

How it works...

The pairs function draws upon the underlying plot to walk through all pairs of data points in the dataset and produce a Scatter plot. I have used this many times to get a quick handle on which variables...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Jupyter Cookbook
Published in: Apr 2018Publisher: PacktISBN-13: 9781788839440

Author (1)

author image
Dan Toomey

Dan Toomey has been developing application software for over 20 years. He has worked in a variety of industries and companies, in roles from sole contributor to VP/CTO-level. For the last few years, he has been contracting for companies in the eastern Massachusetts area. Dan has been contracting under Dan Toomey Software Corp. Dan has also written R for Data Science, Jupyter for Data Sciences, and the Jupyter Cookbook, all with Packt.
Read more about Dan Toomey