Abstract
This paper studies summary measures of the predictive power of a generalized linear model, paying special attention to a generalization of the multiple correlation coefficient from ordinary linear regression. The population value is the correlation between the response and its conditional expectation given the predictors, and the sample value is the correlation between the observed response and the model predicted value. We compare four estimators of the measure in terms of bias, mean squared error and behaviour in the presence of overparameterization. The sample estimator and a jack-knife estimator usually behave adequately, but a cross-validation estimator has a large negative bias with large mean squared error. One can use bootstrap methods to construct confidence intervals for the population value of the correlation measure and to estimate the degree to which a model selection procedure may provide an overly optimistic measure of the actual predictive power.
Keywords
Affiliated Institutions
Related Publications
On the degrees of freedom in shape-restricted regression
For the problem of estimating a regression function, $\\mu$ say,\nsubject to shape constraints, like monotonicity or convexity, it is argued that\nthe divergence of the maximum ...
Theory for penalised spline regression
Penalised spline regression is a popular new approach to smoothing, but its theoretical properties are not yet well understood. In this paper, mean squared error expressions and...
Extension of the Gauss-Markov Theorem to Include the Estimation of Random Effects
The general mixed linear model can be written $y = X\\alpha + Zb$, where $\\alpha$ is a vector of fixed effects and $b$ is a vector of random variables. Assume that $E(b) = 0$ a...
Fast Stable Restricted Maximum Likelihood and Marginal Likelihood Estimation of Semiparametric Generalized Linear Models
Summary Recent work by Reiss and Ogden provides a theoretical basis for sometimes preferring restricted maximum likelihood (REML) to generalized cross-validation (GCV) for smoot...
How Biased is the Apparent Error Rate of a Prediction Rule?
Abstract A regression model is fitted to an observed set of data. How accurate is the model for predicting future observations? The apparent error rate tends to underestimate th...
Publication Info
- Year
- 2000
- Type
- article
- Volume
- 19
- Issue
- 13
- Pages
- 1771-1781
- Citations
- 272
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1002/1097-0258(20000715)19:13<1771::aid-sim485>3.0.co;2-p