Packt+ | Advance your knowledge in tech

You're reading from Applied Supervised Learning with Python Use scikit-learn to build predictive models from real-world datasets and prepare yourself for the future of machine learning

Product type Paperback

Published in Apr 2019

Publisher

ISBN-13 9781789954920

Length 404 pages

Edition 1st Edition

Languages

Python

Tools

Scikit-learn

Concepts

Machine Learning

Authors (2):

Benjamin Johnston

Ishita Mathur

View More author details

Table of Contents (9) Chapters

Applied Supervised Learning with Python

Preface

1. Python Machine Learning Toolkit FREE CHAPTER

2. Exploratory Data Analysis and Visualization

3. Regression Analysis

4. Classification

5. Ensemble Modeling

6. Model Evaluation

Appendix

Chapter 5: Ensemble Modeling

Activity 14: Stacking with Standalone and Ensemble Algorithms

Solution

Import the relevant libraries:

import pandas as pd
import numpy as np
import seaborn as sns

%matplotlib inline
import matplotlib.pyplot as plt

from sklearn.model_selection import train_test_split
from sklearn.metrics import mean_absolute_error
from sklearn.model_selection import KFold

from sklearn.linear_model import LinearRegression
from sklearn.tree import DecisionTreeRegressor
from sklearn.neighbors import KNeighborsRegressor
from sklearn.ensemble import GradientBoostingRegressor, RandomForestRegressor

Read the data and print the first five rows:
```
data = pd.read_csv('house_prices.csv')
data.head()
```
The output will be as follows:
Figure 5.19: The first 5 rows
Preprocess the dataset to remove null values and one-hot encode categorical variables to prepare the data for modeling.
First, we remove all columns where more than 10% of the values are null. To do this, calculate the fraction of missing values...