Search icon
Subscription
0
Cart icon
Close icon
You have no products in your basket yet
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
The Machine Learning Workshop - Second Edition

You're reading from  The Machine Learning Workshop - Second Edition

Product type Book
Published in Jul 2020
Publisher Packt
ISBN-13 9781839219061
Pages 286 pages
Edition 2nd Edition
Languages
Author (1):
Hyatt Saleh Hyatt Saleh
Profile icon Hyatt Saleh

3. Supervised Learning – Key Steps

Activity 3.01: Data Partitioning on a Handwritten Digit Dataset

Solution:

  1. Import all the required elements to split a dataset, as well as the load_digits function from scikit-learn to load the digits dataset. Use the following code to do so:
    from sklearn.datasets import load_digits
    import pandas as pd
    from sklearn.model_selection import train_test_split
    from sklearn.model_selection import KFold
  2. Load the digits dataset and create Pandas DataFrames containing the features and target matrices:
    digits = load_digits()
    X = pd.DataFrame(digits.data)
    Y = pd.DataFrame(digits.target)
    print(X.shape, Y.shape)

    The shape of your features and target matrices should be as follows, respectively:

    (1797, 64) (1797, 1)
  3. Perform the conventional split approach, using a split ratio of 60/20/20%.

    Using the train_test_split function, split the data into an initial train set and a test set:

    X_new, X_test, \
    Y_new, Y_test = train_test_split(X, Y, test_size...
lock icon The rest of the chapter is locked
arrow left Previous Chapter
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}