Reader small image

You're reading from  Hands-On Exploratory Data Analysis with R

Product typeBook
Published inMay 2019
Reading LevelIntermediate
PublisherPackt
ISBN-139781789804379
Edition1st Edition
Languages
Tools
Right arrow
Authors (2):
Radhika Datar
Radhika Datar
author image
Radhika Datar

Radhika Datar has more than 5 years' experience in software development and content writing. She is well versed in frameworks such as Python, PHP, and Java, and regularly provides training on them. She has been working with Educba and Eduonix as a training consultant since June 2016, while also working as a freelance academic writer in data science and data analytics. She obtained her master's degree from the Symbiosis Institute of Computer Studies and Research and her bachelor's degree from K. J. Somaiya College of Science and Commerce.
Read more about Radhika Datar

Harish Garg
Harish Garg
author image
Harish Garg

Harish Garg is a Principal Software Developer, author, and co-founder of a software development and training company, Bignumworks. Harish has more than 19 years of experience in a wide variety of technologies, including blockchain, data science and enterprise software. During this time, he has worked for companies such as McAfee, Intel, etc.
Read more about Harish Garg

View More author details
Right arrow

Cleaning the dataset

Data cleaning, or rather, tidying up the data is the process of transforming raw data into specific consistent data, which includes analysis in a simpler manner. The R programming language includes a set of comprehensive tools that are specifically designed to clean the data in an effective manner. We will focus on cleaning the dataset over here in a specific way.

The following steps are carried out to perform cleaning attributes of datasets or data frames:

  1. Include the libraries that are required to clean and tidy up the dataset as follows:
> library(dplyr)
> library(tidyr)
  1. Analyze the summary of our dataset as shown here, which will help us to focus on which attributes we need to work on:
> summary(GlassDataset)
Id RI Na Mg Al Si

Min. : 1.00 Min. :1.511 Min. :10.73 Min. :0.000 Min. :0.290 Min. :69.81

1st Qu.: 54.25 1st Qu.:1.517 1st Qu.:12.91 1st...
lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Hands-On Exploratory Data Analysis with R
Published in: May 2019Publisher: PacktISBN-13: 9781789804379

Authors (2)

author image
Radhika Datar

Radhika Datar has more than 5 years' experience in software development and content writing. She is well versed in frameworks such as Python, PHP, and Java, and regularly provides training on them. She has been working with Educba and Eduonix as a training consultant since June 2016, while also working as a freelance academic writer in data science and data analytics. She obtained her master's degree from the Symbiosis Institute of Computer Studies and Research and her bachelor's degree from K. J. Somaiya College of Science and Commerce.
Read more about Radhika Datar

author image
Harish Garg

Harish Garg is a Principal Software Developer, author, and co-founder of a software development and training company, Bignumworks. Harish has more than 19 years of experience in a wide variety of technologies, including blockchain, data science and enterprise software. During this time, he has worked for companies such as McAfee, Intel, etc.
Read more about Harish Garg