Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Machine Learning with Python

You're reading from  Machine Learning with Python

Product type Book
Published in Mar 2024
Publisher Packt
ISBN-13 9781835461969
Pages 146 pages
Edition 1st Edition
Languages
Author (1):
Oliver Theobald Oliver Theobald
Profile icon Oliver Theobald

Table of Contents (18) Chapters

FOREWORD
DATASETS USED IN THIS BOOK
INTRODUCTION
DEVELOPMENT ENVIRONMENT
MACHINE LEARNING LIBRARIES
EXPLORATORY DATA ANALYSIS
DATA SCRUBBING
PRE-MODEL ALGORITHMS
SPLIT VALIDATION
MODEL DESIGN
LINEAR REGRESSION
LOGISTIC REGRESSION
SUPPORT VECTOR MACHINES
k-NEAREST NEIGHBORS
TREE-BASED METHODS
NEXT STEPS
APPENDIX 1: INTRODUCTION TO PYTHON
APPENDIX 2: PRINT COLUMNS

SPLIT VALIDATION

 

A crucial part of machine learning is partitioning the data into two separate sets using a technique called split validation. The first set is called the training data and is used to build the prediction model. The second set is called the test data and is kept in reserve and used to assess the accuracy of the model developed from the training data. The training and test data is typically split 70/30 or 80/20 with the training data representing the larger portion. Once the model has been optimized and validated against the test data for accuracy, it’s ready to generate predictions using new input data.

Although the model is used on both the training and test sets, it’s from the training data alone that the model is built. The test data is used as input to form predictions and assess the model’s accuracy, but it is never decoded and should not be used to create the model. As the test data cannot be used to build and optimize the model, data...

lock icon The rest of the chapter is locked
You have been reading a chapter from
Machine Learning with Python
Published in: Mar 2024 Publisher: Packt ISBN-13: 9781835461969
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}