Abstract
Missing data frequently complicates data analysis for scientific investigations. The development of statistical methods to address missing data has been an active area of research in recent decades. Multiple imputation, originally proposed by Rubin in a public use dataset setting, is a general purpose method for analyzing datasets with missing data that is broadly applicable to a variety of missing data settings. We review multiple imputation as an analytic strategy formissing data. Wedescribe and evaluate a number of software packages that implement this procedure, and contrast the interface, features, and results. We compare the packages, and detail shortcomings and useful features. The comparisons are illustrated using examples from an artificial dataset and a study of child psychopathology. We suggest additional features as well as discuss limitations and cautions to consider when using multiple imputation as an analytic strategy for incomplete data settings.
Keywords
Affiliated Institutions
Related Publications
Multiple Imputation of Missing Values
Following the seminal publications of Rubin about thirty years ago, statisticians have become increasingly aware of the inadequacy of “complete-case” analysis of datasets with m...
A comparison of inclusive and restrictive strategies in modern missing data procedures.
Two classes of modern missing data procedures, maximum likelihood (ML) and multiple imputation (MI), tend to yield similar results when implemented in comparable ways. In either...
A New Framework for Managing and Analyzing Multiply Imputed Data in Stata
A new set of tools is described for performing analyses of an ensemble of datasets that includes multiple copies of the original data with imputations of missing values, as requ...
Multiple imputation: review of theory, implementation and software
Abstract Missing data is a common complication in data analysis. In many medical settings missing data can cause difficulties in estimation, precision and inference. Multiple im...
Robustness of a multivariate normal approximation for imputation of incomplete binary data
Abstract Multiple imputation has become easier to perform with the advent of several software packages that provide imputations under a multivariate normal model, but imputation...
Publication Info
- Year
- 2001
- Type
- article
- Volume
- 55
- Issue
- 3
- Pages
- 244-254
- Citations
- 597
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1198/000313001317098266