Boosting and Other Ensemble Methods

Harris Drucker; Corinna Cortes; L. D. Jackel; Yann LeCun; Vladimir Vapnik

doi:10.1162/neco.1994.6.6.1289

Abstract

We compare the performance of three types of neural network-based ensemble techniques to that of a single neural network. The ensemble algorithms are two versions of boosting and committees of neural networks trained independently. For each of the four algorithms, we experimentally determine the test and training error curves in an optical character recognition (OCR) problem as both a function of training set size and computational cost using three architectures. We show that a single machine is best for small training set size while for large training set size some version of boosting is best. However, for a given computational cost, boosting is always best. Furthermore, we show a surprising result for the original boosting algorithm: namely, that as the training set size increases, the training error decreases until it asymptotes to the test error rate. This has potential implications in the search for better training algorithms.

Keywords

Boosting (machine learning)Artificial neural networkGradient boostingComputer scienceMachine learningArtificial intelligenceAsymptoteWord error rateTraining setTest setEnsemble learningAlgorithmPattern recognition (psychology)Mathematics

Affiliated Institutions

AT&T (United States) US

Related Publications

BOOSTING PERFORMANCE IN NEURAL NETWORKS

Harris Drucker , Robert E. Schapire , Patrice Simard

A boosting algorithm, based on the probably approximately correct (PAC) learning model is used to construct an ensemble of neural networks that significantly improves performanc...

1993 International Journal of Pattern Reco... 184 citations

Improving Performance in Neural Networks Using a Boosting Algorithm

Harris Drucker , Robert E. Schapire , Patrice Simard

A boosting algorithm converts a learning machine with error rate less than 50% to one with an arbitrarily low error rate. However, the algorithm discussed here depends on having...

1992 Neural Information Processing Systems 158 citations

Boosting the margin: A new explanation for the effectiveness of voting methods

Robert E. Schapire , Yoav Freund , Peter Barlett +1 more

One of the surprising recurring phenomena observed in experiments with boosting is that the test error of the generated classifier usually does not increase as its size becomes ...

1997 QUT ePrints (Queensland University of... 578 citations

What size test set gives good error rate estimates?

Isabelle Guyon , J. Makhoul , Richard Schwartz +1 more

We address the problem of determining what size test set guarantees statistically significant results in a character recognition task, as a function of the expected error rate. ...

1998 IEEE Transactions on Pattern Analysis... 152 citations

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe , Christian Szegedy

Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. T...

2024 arXiv (Cornell University) 15635 citations

Publication Info

Year: 1994
Type: article
Volume: 6
Issue: 6
Pages: 1289-1301
Citations: 352
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Boosting and Other Ensemble Methods

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

352

OpenAlex

Cite This

APA Style

                            
                                
                                    Harris Drucker, 
                                
                                    Corinna Cortes, 
                                
                                    L. D. Jackel
                                
                                et al.
                            
                            (1994). 
                            Boosting and Other Ensemble Methods. 
                            Neural Computation
                            , 6
                            (6)
                            , 1289-1301.
                            https://doi.org/10.1162/neco.1994.6.6.1289
                        

Identifiers

DOI: 10.1162/neco.1994.6.6.1289