Abstract

Recent work has shown that learning an ensemble consisting of multiple models and then making classifications by combining the classifications of the models often leads to more accurate classifications then those based on a single model learned from the same data. However, the amount of error reduction achieved varies from data set to data set. This paper provides empirical evidence that there is a linear relationship between the degree of error reduction and the degree to which patterns of errors made by individual models are uncorrelated. Ensemble error rate is most reduced in ensembles whose constituents make individual errors in a less correlated manner. The second result of the work is that some of the greatest error reductions occur on domains for which many ties in information gain occur during learning. The third result is that ensembles consisting of models that make errors in a dependent but "negatively correlated" manner will have lower ensemble error rates than ensembles wh...

Keywords

UncorrelatedReduction (mathematics)Set (abstract data type)Computer scienceTree (set theory)Word error rateArtificial intelligenceCorrelationEnsemble learningMathematicsVariation (astronomy)AlgorithmStatisticsMachine learning

Related Publications

Publication Info

Year
1995
Type
article
Citations
73
Access
Closed

External Links

Citation Metrics

73
OpenAlex

Cite This

Kamal Ali, Michael J. Pazzani (1995). On the link between error correlation and error reduction in decision tree ensembles. eScholarship (California Digital Library) .