What do you get with Print?

Instant access to your digital copy whilst your Print order is Shipped

Paperback book shipped to your preferred address

Redeem a companion digital copy on all Print orders

Access this title in our online reader with advanced features

DRM FREE - Read whenever, wherever and however you want

Mastering Python for Data Science

Chapter 2. Inferential Statistics

Before getting understanding the inferential statistics, let's look at what descriptive statistics is about.

Descriptive statistics is a term given to data analysis that summarizes data in a meaningful way such that patterns emerge from it. It is a simple way to describe data, but it does not help us to reach a conclusion on the hypothesis that we have made. Let's say you have collected the height of 1,000 people living in Hong Kong. The mean of their height would be descriptive statistics, but their mean height does not indicate that it's the average height of whole of Hong Kong. Here, inferential statistics will help us in determining what the average height of whole of Hong Kong would be, which is described in depth in this chapter.

Inferential statistics is all about describing the larger picture of the analysis with a limited set of data and deriving conclusions from it.

In this chapter, we will cover the following topics:

The...

Description

Data science is a relatively new knowledge domain which is used by various organizations to make data driven decisions. Data scientists have to wear various hats to work with data and to derive value from it. The Python programming language, beyond having conquered the scientific community in the last decade, is now an indispensable tool for the data science practitioner and a must-know tool for every aspiring data scientist. Using Python will offer you a fast, reliable, cross-platform, and mature environment for data analysis, machine learning, and algorithmic problem solving. This comprehensive guide helps you move beyond the hype and transcend the theory by providing you with a hands-on, advanced study of data science. Beginning with the essentials of Python in data science, you will learn to manage data and perform linear algebra in Python. You will move on to deriving inferences from the analysis by performing inferential statistics, and mining data to reveal hidden patterns and trends. You will use the matplot library to create high-end visualizations in Python and uncover the fundamentals of machine learning. Next, you will apply the linear regression technique and also learn to apply the logistic regression technique to your applications, before creating recommendation engines with various collaborative filtering algorithms and improving your predictions by applying the ensemble methods. Finally, you will perform K-means clustering, along with an analysis of unstructured data with different text mining techniques and leveraging the power of Python in big data analytics.

What you will learn

Manage data and perform linear algebra in Python

Derive inferences from the analysis by performing inferential statistics

Solve data science problems in Python

Create highend visualizations using Python

Evaluate and apply the linear regression technique to estimate the relationships among variables.

Build recommendation engines with the various collaborative filtering algorithms

Apply the ensemble methods to improve your predictions

Work with big data technologies to handle data at scale

What do you get with Print?

Instant access to your digital copy whilst your Print order is Shipped

Paperback book shipped to your preferred address

Redeem a companion digital copy on all Print orders

Access this title in our online reader with advanced features

DRM FREE - Read whenever, wherever and however you want

Frequently bought together

Python Machine Learning

Sep 2015 454 pages

4.3 (100)

eBook

Can$44.99 ~~Can$49.99~~

Learning Data Mining with Python

Jul 2015 344 pages

3.7 (7)

eBook

Can$44.99 ~~Can$49.99~~

Mastering Python for Data Science

Aug 2015 294 pages

3.6 (10)

eBook

Can$60.29 ~~Can$66.99~~

Total Can$ 166.37 207.97 41.60 saved

Can$49.59 ~~Can$61.99~~

Can$67.19 ~~Can$83.99~~

Total Can$ 166.37 207.97 41.60 saved

Filter reviews by

All

Amazon verified reviews

ruben Oct 13, 2015

Hello I would like to recommend this book I like this book because its content is about Python with appliations in Science and has very interesting programs that we can develop using this language. It explains since the beginning to the most interesting projects. in order to apply them.Beginning with the essentials of Python in data science, you will learn to manage data and perform linear algebra in Python.

Amazon Verified review

Arunkumar S Jan 25, 2019

Good book with right content

Natester Oct 12, 2015

Madhavan's book has proven useful for some of the projects I'm working on.The first chapter includes a brief primer on Numpy and Pandas--useful for someone that is new to the Python ecosystem, but assuming you are already familiar with those packages, it should be okay to skip to the second chapter. The second chapter includes some Python statistical examples that I have not seen in other texts, but are important when looking at different types of distributions. These distribution examples and explanations are a must-have in my collection of Python recipes. There are also data visualization tweaks that I've not seen in other Data Science + Python texts.The book also provides an intro to some of the canonical machine learning algorithms (Chapter 5). These examples are great for becoming familiarized with some of the ML algorithms out there without being overwhelmed by all the other algorithms out there.If you are looking for a good primer on Data Science with Python, this is a good book. I'm using the book as a reference more than a primer and the book is also useful.

jamie May 28, 2016

Good

Jonathan Brett Crawley Oct 09, 2015

The pace of the book is quite quick, so you will be up to speed in no time. The book gives a nice introduction to the algorithms used in data science, explained well and backed up with source code examples of how to implement them in the Python language. My only criticism would be that there are a number of grammatical errors in the text but they do not obstruct the reader from understanding the material. Overall a good beginners book for getting to know the world of data science

Mastering Python for Data Science: Explore the world of data science through Python and learn how to make sense of data

What do you get with Print?

Mastering Python for Data Science

Chapter 2. Inferential Statistics

Various forms of distribution

A normal distribution

A z-score

A p-value

One-tailed and two-tailed tests

Type 1 and Type 2 errors

Various forms of distribution

A normal distribution

A z-score

A p-value

One-tailed and two-tailed tests

Type 1 and Type 2 errors

A confidence interval

Correlation

Z-test vs T-test

Page 1 of 14

Description

Who is this book for?

What you will learn

Product Details

What do you get with Print?

Product Details

Frequently bought together

Table of Contents

Recommendations for you

Customer reviews

Filter reviews by

People who bought this also bought

About the author

FAQs

Mastering Python for Data Science: Explore the world of data science through Python and learn how to make sense of data

What do you get with Print?

Contact Details

Shipping Address

Billing Address

A normal distribution

Description

Who is this book for?

What you will learn

Product Details

What do you get with Print?

Contact Details

Shipping Address

Billing Address

Product Details

Packt Subscriptions

Frequently bought together

Table of Contents

Recommendations for you

Customer reviews

Filter reviews by

People who bought this also bought

About the author

FAQs

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access