Abstract

Summary The biases and shortcomings of stepwise multiple regression are well established within the statistical literature. However, an examination of papers published in 2004 by three leading ecological and behavioural journals suggested that the use of this technique remains widespread: of 65 papers in which a multiple regression approach was used, 57% of studies used a stepwise procedure. The principal drawbacks of stepwise multiple regression include bias in parameter estimation, inconsistencies among model selection algorithms, an inherent (but often overlooked) problem of multiple hypothesis testing, and an inappropriate focus or reliance on a single best model. We discuss each of these issues with examples. We use a worked example of data on yellowhammer distribution collected over 4 years to highlight the pitfalls of stepwise regression. We show that stepwise regression allows models containing significant predictors to be obtained from each year's data. In spite of the significance of the selected models, they vary substantially between years and suggest patterns that are at odds with those determined by analysing the full, 4‐year data set. An information theoretic (IT) analysis of the yellowhammer data set illustrates why the varying outcomes of stepwise analyses arise. In particular, the IT approach identifies large numbers of competing models that could describe the data equally well, showing that no one model should be relied upon for inference.

Keywords

Stepwise regressionModel selectionInferenceRegressionRegression analysisSet (abstract data type)StatisticsSelection (genetic algorithm)Statistical inferenceEcologyComputer scienceEconometricsMachine learningArtificial intelligenceMathematicsBiology

Affiliated Institutions

Related Publications

Introduction to Econometrics

Foreword. Preface to the Second Edition. Preface to the Third Edition. Obituary. INTRODUCTION AND THE LINEAR REGRESSION MODEL. What is Econometrics? Statistical Background and M...

2020 WORLD SCIENTIFIC eBooks 3511 citations

Publication Info

Year
2006
Type
review
Volume
75
Issue
5
Pages
1182-1189
Citations
1478
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1478
OpenAlex

Cite This

Mark J. Whittingham, Philip A. Stephens, Richard B. Bradbury et al. (2006). Why do we still use stepwise modelling in ecology and behaviour?. Journal of Animal Ecology , 75 (5) , 1182-1189. https://doi.org/10.1111/j.1365-2656.2006.01141.x

Identifiers

DOI
10.1111/j.1365-2656.2006.01141.x