Reader small image

You're reading from  Machine Learning with Python

Product typeBook
Published inMar 2024
PublisherPackt
ISBN-139781835461969
Edition1st Edition
Right arrow
Author (1)
Oliver Theobald
Oliver Theobald
author image
Oliver Theobald

Oliver Theobald, a technical writer and best-selling author, excels in AI, fintech, and cloud computing. With global experience, he now resides between China and Japan, deepening his expertise in technology. As an instructor, Oliver emphasizes clarity and engagement, stripping away jargon to make complex topics accessible. His courses aim to empower both beginners and professionals with practical skills for success in the tech industry, making learning both effective and enjoyable.
Read more about Oliver Theobald

Right arrow

EXPLORATORY DATA ANALYSIS

 

In this chapter, we introduce managing data as a Pandas dataframe and common exploratory data analysis (EDA) techniques.

As a key part of data inspection, EDA involves summarizing the salient characteristics of your dataset in preparation for further processing and analysis. This includes understanding the shape and distribution of the data, scanning for missing values, learning which features are most relevant based on correlation, and familiarizing yourself with the overall contents of the dataset. Gathering this intel helps to inform algorithm selection and highlight parts of the data that require cleaning in preparation for further processing.

Using Pandas, there’s a range of simple techniques we can use to summarize data and additional options to visualize the data using Seaborn and Matplotlib.

Let’s begin by importing Pandas, Seaborn, and Matplotlib inline using the following code in Jupyter Notebook.

 

import...

lock icon
The rest of the chapter is locked
You have been reading a chapter from
Machine Learning with Python
Published in: Mar 2024Publisher: PacktISBN-13: 9781835461969
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
undefined
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime

Author (1)

author image
Oliver Theobald

Oliver Theobald, a technical writer and best-selling author, excels in AI, fintech, and cloud computing. With global experience, he now resides between China and Japan, deepening his expertise in technology. As an instructor, Oliver emphasizes clarity and engagement, stripping away jargon to make complex topics accessible. His courses aim to empower both beginners and professionals with practical skills for success in the tech industry, making learning both effective and enjoyable.
Read more about Oliver Theobald