Reader small image

You're reading from  Hands-On Exploratory Data Analysis with R

Product typeBook
Published inMay 2019
Reading LevelIntermediate
PublisherPackt
ISBN-139781789804379
Edition1st Edition
Languages
Tools
Right arrow
Authors (2):
Radhika Datar
Radhika Datar
author image
Radhika Datar

Radhika Datar has more than 5 years' experience in software development and content writing. She is well versed in frameworks such as Python, PHP, and Java, and regularly provides training on them. She has been working with Educba and Eduonix as a training consultant since June 2016, while also working as a freelance academic writer in data science and data analytics. She obtained her master's degree from the Symbiosis Institute of Computer Studies and Research and her bachelor's degree from K. J. Somaiya College of Science and Commerce.
Read more about Radhika Datar

Harish Garg
Harish Garg
author image
Harish Garg

Harish Garg is a Principal Software Developer, author, and co-founder of a software development and training company, Bignumworks. Harish has more than 19 years of experience in a wide variety of technologies, including blockchain, data science and enterprise software. During this time, he has worked for companies such as McAfee, Intel, etc.
Read more about Harish Garg

View More author details
Right arrow

Preface

As the name suggests, Hands-On Exploratory Data Analysis with R practically demonstrates the complete process of exploratory data analysis. In this book you will learn about the complete process of exploratory data analysis using R and some of its most popular and powerful packages. You will understand the concepts of data analysis right from data ingestion, data cleaning, and data manipulation, to applying statistical techniques and visualizing hidden patterns. By the end of this book, you will be able to expand your real-world R knowledge by means of practical real-world data analysis projects.

Who this book is for

This book is for you if you are looking to build a strong foundation in data analysis. Whether you are a data analyst, data engineer, software engineer, product manager, or anybody who is involved with data in any way, this book will sharpen your skills in the overarching workflow of exploratory data analysis.

What this book covers

Chapter 1, Setting Up Our Data Analysis Environment, introduces the overall goal of this book. This chapter stipulates how exploratory data analysis benefits business and has a significant impact across almost all verticals.

Chapter 2, Importing Diverse Datasets, demonstrates practical, hands-on code examples on reading in all kinds of data into R for exploratory data analysis. This chapter also covers how to use advanced options while importing datasets such as delimited data, Excel data, JSON data, and data from web APIs.

Chapter 3, Examining, Cleaning, and Filtering, introduces how to identify and clean missing and erroneous data formats. This chapter also covers concepts such as data manipulation, wrangling, and reshaping.

Chapter 4, Visualizing Data Graphically with ggplot2, demonstrates how to draw different kinds of plots and charts, including scatter plots, histograms, probability plots, residual plots, boxplots, and block plots.

Chapter 5, Creating Aesthetically Pleasing Reports with knitr and R Markdown, explains how to use RStudio to wrap your code, graphics, plots, and findings in a complete and informative data analysis report. The chapter will also look at how to publish these in different formats for different audiences using R Markdown and packages such as knitr.

Chapter 6, Univariate and Control Datasets, takes a real-world univariate and control dataset and runs an entire exploratory data analysis workflow on it using the R packages and techniques.

Chapter 7, Time Series Datasets, introduces a time series dataset and describes how to use exploratory data analysis techniques to analyze this data.

Chapter 8, Multivariate Datasets, introduces a dataset from the multivariate problem category. This chapter explains how to use exploratory data analysis techniques to analyze this data, as well as how to use the exploratory data analysis techniques of the star plot, the scatter plot matrix, the conditioning plot, and their principal components.

Chapter 9, Multi-Factor Datasets, introduces a multi-factor dataset and explains how to use exploratory data analysis techniques to analyze this data.

Chapter 10, Handling Optimization and Regression Data Problems, introduces a dataset from the regression problem category and describes how to use exploratory data analysis techniques to analyze this data. It also shows how to learn and apply these exploratory data analysis techniques.

Chapter 11, Next Steps, covers how to build a roadmap for yourself to consolidate the skills you have learned in this book and gain further expertise in the field of data science with R.

To get the most out of this book

In order to get the most out of this book, you should already be familiar with the basics of the R programming language and you should possess at least a rudimentary knowledge of data analysis, irrespective of the tool or programming language. If you would like to learn the basics of R, we recommend one of our excellent Packt titles.

Download the example code files

You can download the example code files for this book from your account at www.packt.com. If you purchased this book elsewhere, you can visit www.packt.com/support and register to have the files emailed directly to you.

You can download the code files by following these steps:

  1. Log in or register at www.packt.com.
  2. Select the SUPPORT tab.
  3. Click on Code Downloads & Errata.
  4. Enter the name of the book in the Search box and follow the onscreen instructions.

Once the file is downloaded, please make sure that you unzip or extract the folder using the latest version of:

  • WinRAR/7-Zip for Windows
  • Zipeg/iZip/UnRarX for Mac
  • 7-Zip/PeaZip for Linux

The code bundle for the book is also hosted on GitHub at https://github.com/PacktPublishing/Hands-On-Exploratory-Data-Analysis-with-R. In case there's an update to the code, it will be updated on the existing GitHub repository.

We also have other code bundles from our rich catalog of books and videos available at https://github.com/PacktPublishing/. Check them out!

Download the color images

Code in Action

Conventions used

There are a number of text conventions used throughout this book.

CodeInText: Indicates code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles. Here is an example: "Mount the downloaded WebStorm-10*.dmg disk image file as another disk in your system."

A block of code is set as follows:

html, body, #map {
height: 100%;
margin: 0;
padding: 0
}

When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:

[default]
exten => s,1,Dial(Zap/1|30)
exten => s,2,Voicemail(u100)
exten => s,102,Voicemail(b100)
exten => i,1,Voicemail(s0)

Any command-line input or output is written as follows:

$ mkdir css
$ cd css

Bold: Indicates a new term, an important word, or words that you see on screen. For example, words in menus or dialog boxes appear in the text like this. Here is an example: "Select System info from the Administration panel."

Warnings or important notes appear like this.
Tips and tricks appear like this.

Get in touch

Feedback from our readers is always welcome.

General feedback: If you have questions about any aspect of this book, mention the book title in the subject of your message and email us at customercare@packtpub.com.

Errata: Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you have found a mistake in this book, we would be grateful if you would report this to us. Please visit www.packt.com/submit-errata, selecting your book, clicking on the Errata Submission Form link, and entering the details.

Piracy: If you come across any illegal copies of our works in any form on the internet, we would be grateful if you would provide us with the location address or website name. Please contact us at copyright@packt.com with a link to the material.

If you are interested in becoming an author: If there is a topic that you have expertise in, and you are interested in either writing or contributing to a book, please visit authors.packtpub.com.

Reviews

Please leave a review. Once you have read and used this book, why not leave a review on the site that you purchased it from? Potential readers can then see and use your unbiased opinion to make purchase decisions, we at Packt can understand what you think about our products, and our authors can see your feedback on their book. Thank you!

For more information about Packt, please visit packt.com.

lock icon
The rest of the chapter is locked
You have been reading a chapter from
Hands-On Exploratory Data Analysis with R
Published in: May 2019Publisher: PacktISBN-13: 9781789804379
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
undefined
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime

Authors (2)

author image
Radhika Datar

Radhika Datar has more than 5 years' experience in software development and content writing. She is well versed in frameworks such as Python, PHP, and Java, and regularly provides training on them. She has been working with Educba and Eduonix as a training consultant since June 2016, while also working as a freelance academic writer in data science and data analytics. She obtained her master's degree from the Symbiosis Institute of Computer Studies and Research and her bachelor's degree from K. J. Somaiya College of Science and Commerce.
Read more about Radhika Datar

author image
Harish Garg

Harish Garg is a Principal Software Developer, author, and co-founder of a software development and training company, Bignumworks. Harish has more than 19 years of experience in a wide variety of technologies, including blockchain, data science and enterprise software. During this time, he has worked for companies such as McAfee, Intel, etc.
Read more about Harish Garg