Generalized Boosted Models: A guide to the gbm package

Abstract

Boosting takes on various forms with different programs using different loss functions, different base models, and different optimization schemes. The gbm package takes the approach described in [2] and [3]. Some of the terminology differs, mostly due to an effort to cast boosting terms into more standard sta-tistical terminology (e.g. deviance). In addition, the gbm package implements boosting for models commonly used in statistics but not commonly associated with boosting. The Cox proportional hazard model, for example, is an incred-ibly useful model and the boosting framework applies quite readily with only slight modification [5]. Also some algorithms implemented in the gbm package differ from the standard implementation. The AdaBoost algorithm [1] has a particular loss function and a particular optimization algorithm associated with it. The gbm implementation of AdaBoost adopts AdaBoost’s exponential loss function (its bound on misclassification rate) but uses Friedman’s gradient de-scent algorithm rather than the original one proposed. So the main purposes of this document is to spell out in detail what the gbm package implements. 1 Gradient boosting This section essentially presents the derivation of boosting described in [2]. The gbm package also adopts the stochastic gradient boosting strategy, a small but important tweak on the basic algorithm, described in [3]. 1.1 Friedman’s gradient boosting machine Friedman (2001) and the companion paper Friedman (2002) extended the work of Friedman, Hastie, and Tibshirani (2000) and laid the ground work for a new generation of boosting algorithms. Using the connection between boosting and optimization, this new work proposes the Gradient Boosting Machine. In any function estimation problem we wish to find a regression function, f̂(x), that minimizes the expectation of some loss function, Ψ(y, f), as shown in (4). f̂(x) = arg min f(x) Ey,xΨ(y, f(x)) 1 Initialize f̂(x) to be a constant, f̂(x) = arg minρ ∑N i=1 Ψ(yi, ρ). For t in 1,..., T do 1. Compute the negative gradient as the working response zi = − ∂

Keywords

Boosting (machine learning)AdaBoostGradient boostingComputer scienceR packageArtificial intelligenceMachine learningTerminologyAlgorithmClassifier (UML)Random forest

Related Publications

<b>ada</b>: An<i>R</i>Package for Stochastic Boosting

Mark V. Culp , Kjell Johnson , George Michailidis

Boosting is an iterative algorithm that combines simple classification rules with "mediocre" performance in terms of misclassification error rate to produce a highly accurate cl...

2006 Journal of Statistical Software 92 citations

Greedy function approximation: A gradient boosting machine.

Jerome H. Friedman

Function estimation/approximation is viewed from the perspective\nof numerical optimization in function space, rather than parameter space. A\nconnection is made between stagewi...

2001 The Annals of Statistics 26394 citations

Experiments with a new boosting algorithm

Yoav Freund , Robert E. Schapire

In an earlier paper, we introduced a new &quot;boosting&quot; algorithm called AdaBoost which, theoretically, can be used to significantly reduce the error of any learni...

1996 7561 citations

Recalibrating Fully Convolutional Networks With Spatial and Channel “Squeeze and Excitation” Blocks

Abhijit Guha Roy , Nassir Navab , Christian Wachinger

In a wide range of semantic segmentation tasks, fully convolutional neural networks (F-CNNs) have been successfully leveraged to achieve the state-of-the-art performance. Archit...

2018 IEEE Transactions on Medical Imaging 468 citations

Boosting with Maximum Adaptive Sampling

Charles Dubout , François Fleuret

Classical Boosting algorithms, such as AdaBoost, build a strong classifier without concern about the computational cost. Some applications, in particular in computer vision, may...

2011 11 citations

Publication Info

Year: 2006
Type: article
Citations: 769
Access: Closed

External Links

Citation Metrics

769

OpenAlex

Cite This

APA Style

                            
                                    Greg Ridgeway
                                
                            (2006). 
                            Generalized Boosted Models: A guide to the gbm package. 
                            
                            .