Abstract

Two algorithms for producing multiple imputations for missing data are evaluated with simulated data. Software using a propensity score classifier with the approximate Bayesian bootstrap produces badly biased estimates of regression coefficients when data on predictor variables are missing at random or missing completely at random. On the other hand, a regression-based method employing the data augmentation algorithm produces estimates with little or no bias.

Keywords

Missing dataImputation (statistics)StatisticsComputer scienceBayesian probabilityPropensity score matchingRegressionRegression analysisData miningEconometricsMathematics

Affiliated Institutions

Related Publications

Applied Missing Data Analysis

Part 1. An Introduction to Missing Data. 1.1 Introduction. 1.2 Chapter Overview. 1.3 Missing Data Patterns. 1.4 A Conceptual Overview of Missing Data heory. 1.5 A More Formal De...

2010 6888 citations

Multiple Imputation of Missing Values

Following the seminal publications of Rubin about thirty years ago, statisticians have become increasingly aware of the inadequacy of “complete-case” analysis of datasets with m...

2004 The Stata Journal Promoting communica... 2310 citations

Publication Info

Year
2000
Type
article
Volume
28
Issue
3
Pages
301-309
Citations
786
Access
Closed

External Links

Social Impact

Altmetric

Social media, news, blog, policy document mentions

Citation Metrics

786
OpenAlex

Cite This

Paul D. Allison (2000). Multiple Imputation for Missing Data. Sociological Methods & Research , 28 (3) , 301-309. https://doi.org/10.1177/0049124100028003003

Identifiers

DOI
10.1177/0049124100028003003