In this chapter, you've examined how to use classification models and some of the strategies for improving model performance. In addition to transforming categorical features, you've looked at the interpretation of logistic regression accuracy using the ROC curve. In attempting to improve model performance, we demonstrated the use of SVMs and were able to increase performance on the training set the cost of overfitting. Finally, we were able to achieve good performance on the test set through gradient boosted decision trees. Taken together with the material in Chapter 4, Connecting the Dots with Models – Regression Methods, you should now have a full toolkit of methods for continuous and categorical outcomes, which you can apply to problems in main domains.
- Tech Categories
- Best Sellers
- New Releases
- Books
- Videos
- Audiobooks
Tech Categories Popular Audiobooks
- Articles
- Newsletters
- Free Learning
You're reading from Mastering Predictive Analytics with Python
Joseph Babcock has spent more than a decade working with big data and AI in the e-commerce, digital streaming, and quantitative finance domains. Through his career he has worked on recommender systems, petabyte scale cloud data pipelines, A/B testing, causal inference, and time series analysis. He completed his PhD studies at Johns Hopkins University, applying machine learning to the field of drug discovery and genomics.
Read more about Joseph Babcock
Unlock this book and the full library FREE for 7 days
Author (1)
Joseph Babcock has spent more than a decade working with big data and AI in the e-commerce, digital streaming, and quantitative finance domains. Through his career he has worked on recommender systems, petabyte scale cloud data pipelines, A/B testing, causal inference, and time series analysis. He completed his PhD studies at Johns Hopkins University, applying machine learning to the field of drug discovery and genomics.
Read more about Joseph Babcock