Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Hands-On Data Analysis with Pandas - Second Edition

You're reading from  Hands-On Data Analysis with Pandas - Second Edition

Product type Book
Published in Apr 2021
Publisher Packt
ISBN-13 9781800563452
Pages 788 pages
Edition 2nd Edition
Languages
Concepts
Author (1):
Stefanie Molin Stefanie Molin
Profile icon Stefanie Molin

Table of Contents (21) Chapters

Preface Section 1: Getting Started with Pandas
Chapter 1: Introduction to Data Analysis Chapter 2: Working with Pandas DataFrames Section 2: Using Pandas for Data Analysis
Chapter 3: Data Wrangling with Pandas Chapter 4: Aggregating Pandas DataFrames Chapter 5: Visualizing Data with Pandas and Matplotlib Chapter 6: Plotting with Seaborn and Customization Techniques Section 3: Applications – Real-World Analyses Using Pandas
Chapter 7: Financial Analysis – Bitcoin and the Stock Market Chapter 8: Rule-Based Anomaly Detection Section 4: Introduction to Machine Learning with Scikit-Learn
Chapter 9: Getting Started with Machine Learning in Python Chapter 10: Making Better Predictions – Optimizing Models Chapter 11: Machine Learning Anomaly Detection Section 5: Additional Resources
Chapter 12: The Road Ahead Solutions
Other Books You May Enjoy Appendix

Exercises

Complete the following exercises to practice the skills covered in this chapter. Be sure to consult the Machine learning workflow section in the Appendix as a refresher on the process of building models:

  1. Predict star temperature with elastic net linear regression as follows:

    a) Using the data/stars.csv file, build a pipeline to normalize the data with a MinMaxScaler object and then run elastic net linear regression using all the numeric columns to predict the temperature of the star.

    b) Run grid search on the pipeline to find the best values for alpha, l1_ratio, and fit_intercept for the elastic net in the search space of your choice.

    c) Train the model on 75% of the initial data.

    d) Calculate the R2 of your model.

    e) Find the coefficients for each regressor and the intercept.

    f) Visualize the residuals using the plot_residuals() function from the ml_utils.regression module.

  2. Perform multiclass classification of white wine quality using a support vector machine and feature...
lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}