How Biased is the Apparent Error Rate of a Prediction Rule?

Abstract

Abstract A regression model is fitted to an observed set of data. How accurate is the model for predicting future observations? The apparent error rate tends to underestimate the true error rate because the data have been used twice, both to fit the model and to check its accuracy. We provide simple estimates for the downward bias of the apparent error rate. The theory applies to general exponential family linear models and general measures of prediction error. Special attention is given to the case of logistic regression on binary data, with error rates measured by the proportion of misclassified cases. Several connected ideas are compared: Mallows's Cp , cross-validation, generalized cross-validation, the bootstrap, and Akaike's information criterion.

Keywords

Akaike information criterionStatisticsMathematicsLogistic regressionInformation CriteriaMean squared prediction errorRegressionWord error rateData setSet (abstract data type)Observational errorEconometricsModel selectionComputer scienceArtificial intelligence

Affiliated Institutions

Related Publications

Model Selection and Akaike's Information Criterion (AIC): The General Theory and its Analytical Extensions

Hamparsum Bozdogan

During the last fifteen years, Akaike's entropy-based Information Criterion (AIC) has had a fundamental impact in statistical model evaluation problems. This paper studies the g...

1987 Psychometrika 4411 citations

Partial least squares regression and projection on latent structure regression (PLS Regression)

Hervé Abdi

Abstract Partial least squares (PLS) regression ( a.k.a. projection on latent structures) is a recent technique that combines features from and generalizes principal component a...

2010 Wiley Interdisciplinary Reviews Compu... 1363 citations

Further analysis of the data by Akaike's information criterion and the finite corrections

Nariaki Sugiura

Using Akaike's information criterion, three examples of statistical data are reanalyzed and show reasonably definite conclusions. One is concerned with the multiple comparison p...

1978 Communication in Statistics- Theory a... 2090 citations

Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling

Brian Kingsbury

Acoustic models used in hidden Markov model/neural-network (HMM/NN) speech recognition systems are usually trained with a frame-based cross-entropy error criterion. In contrast,...

2009 238 citations

Cross-Validatory Choice and Assessment of Statistical Predictions

M. Stone

Summary A generalized form of the cross-validation criterion is applied to the choice and assessment of prediction using the data-analytic concept of a prescription. The example...

1974 Journal of the Royal Statistical Soci... 10037 citations

Publication Info

Year: 1986
Type: article
Volume: 81
Issue: 394
Pages: 461-461
Citations: 419
Access: Closed

External Links