Statistics for Machine Learning: Techniques for exploring supervised, unsupervised, and reinforcement learning models with Python and R

Name: Statistics for Machine Learning
Brand: Packt
SKU: 9781788295758
Availability: InStock

By Pratap Dangeti

~~$43.99~~ $29.99

Book Jul 2017 442 pages 1st Edition

What do you get with eBook?

Instant access to your Digital eBook purchase

Download this book in EPUB and PDF formats

Access this title in our online reader with advanced features

DRM FREE - Read whenever, wherever and however you want

Buy Now

Product Details

Publication date : Jul 21, 2017

Length 442 pages

Edition : 1st Edition

Language : English

ISBN-13 : 9781788295758

Category :

Data

Languages :

Python (Intermediate)

Concepts :

Statistics

View table of contents

Preview Book

Download Code

Key benefits

Learn about the statistics behind powerful predictive models with p-value, ANOVA, and F- statistics.
Implement statistical computations programmatically for supervised and unsupervised learning through K-means clustering.
Master the statistical aspect of Machine Learning with the help of this example-rich guide to R and Python.

Description

Complex statistics in machine learning worry a lot of developers. Knowing statistics helps you build strong machine learning models that are optimized for a given problem statement. This book will teach you all it takes to perform the complex statistical computations that are required for machine learning. You will gain information on the statistics behind supervised learning, unsupervised learning, reinforcement learning, and more. You will see real-world examples that discuss the statistical side of machine learning and familiarize yourself with it. You will come across programs for performing tasks such as modeling, parameter fitting, regression, classification, density collection, working with vectors, matrices, and more. By the end of the book, you will have mastered the statistics required for machine learning and will be able to apply your new skills to any sort of industry problem.

What you will learn

Understand the statistical and machine learning fundamentals necessary to build models Understand the major differences and parallels between the statistical way and the machine learning way to solve problems Learn how to prepare data and feed models by using the appropriate machine learning algorithms from the more-than-adequate R and Python packages Analyze the results and tune the model appropriately to your own predictive goals Understand the concepts of the statistics required for machine learning Introduce yourself to necessary fundamentals required for building supervised and unsupervised deep learning models Learn reinforcement learning and its application in the field of artificial intelligence domain

Statistical modeling	Machine learning
Formalization of relationships between variables in the form of mathematical equations.	Algorithm that can learn from the data without relying on rule-based programming.
Required to assume shape of the model curve prior to perform model fitting on the data (for example, linear, polynomial, and so on).	Does not need to assume underlying shape, as machine learning algorithms can learn complex patterns automatically based on the provided data.
Statistical model predicts the output with accuracy of 85 percent and having 90 percent confidence about it.	Machine learning just predicts the output with accuracy of 85 percent.
In statistical modeling, various diagnostics of parameters are performed, like p-value, and so on.	Machine learning models do not perform any statistical diagnostic significance tests.
Data will be split into 70 percent - 30 percent to create training and testing data. Model developed on training data and tested on testing data.	Data will be split into 50 percent - 25 percent - 25 percent to create training, validation, and testing data. Models developed on training and hyperparameters are tuned on validation data and finally get evaluated against test data.
Statistical models can be developed on a single dataset called training data, as diagnostics are performed at both overall accuracy and individual variable level.	Due to lack of diagnostics on variables, machine learning algorithms need to be trained on two datasets, called training and validation data, to ensure two-point validation.
Statistical modeling is mostly used for research purposes.	Machine learning is very apt for implementation in a production environment.
From the school of statistics and mathematics.	From the school of computer science.

Fertilizer 1	Fertilizer 2	Fertilizer 3
62	54	48
62	56	62
90	58	92
42	36	96
84	72	92
64	34	80

Statistics for Machine Learning: Techniques for exploring supervised, unsupervised, and reinforcement learning models with Python and R

What do you get with eBook?

Product Details

Chapter 1. Journey from Statistics to Machine Learning

Statistical terminology for model building and validation

Machine learning

Major differences between statistical modeling and machine learning

Steps in machine learning model development and deployment

Statistical fundamentals and terminology for model building and validation

Note

Note

Note

Bias versus variance trade-off

Train and test data

Machine learning terminology for model building and validation

Linear regression versus gradient descent

Note

Machine learning losses

When to stop tuning machine learning models

Train, validation, and test data

Cross-validation

Grid search

Machine learning model overview

Summary

Key benefits

Description

What you will learn

What do you get with eBook?

Product Details

Table of Contents

Recommendations for you

Customer reviews

Filter reviews by

People who bought this also bought

Authors (1)

FAQs

Statistics for Machine Learning: Techniques for exploring supervised, unsupervised, and reinforcement learning models with Python and R

What do you get with eBook?

Product Details