Reader small image

You're reading from  R Statistics Cookbook

Product typeBook
Published inMar 2019
Reading LevelExpert
PublisherPackt
ISBN-139781789802566
Edition1st Edition
Languages
Tools
Concepts
Right arrow
Author (1)
Francisco Juretig
Francisco Juretig
author image
Francisco Juretig

Francisco Juretig has worked for over a decade in a variety of industries such as retail, gambling and finance deploying data-science solutions. He has written several R packages, and is a frequent contributor to the open source community.
Read more about Francisco Juretig

Right arrow

Gradient boosting and class imbalance

Ensembles of models (several models stacked together) can be conceptualized into two main groups: bagging and boosting. Bagging stands for bootstrap aggregation, meaning that several submodels are trained by bootstrapping (resampling with replacement) over the dataset. Each dataset will obviously be different and each model will yield different results. Boosting, on the other hand relies on training subsequent models using the residuals from the previous step. In each step, we have an aggregated model and a new model that is trained over those residuals. Both are combined to build a new combined model optimally (in such a way that the overall predictions are as good as possible).

The most famous bagging technique is random forests, which we have used previously in this chapter. Several boosting techniques have enjoyed an enormous popularity...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
R Statistics Cookbook
Published in: Mar 2019Publisher: PacktISBN-13: 9781789802566

Author (1)

author image
Francisco Juretig

Francisco Juretig has worked for over a decade in a variety of industries such as retail, gambling and finance deploying data-science solutions. He has written several R packages, and is a frequent contributor to the open source community.
Read more about Francisco Juretig