Academic Commons

Articles

Multiple Imputation with Diagnostics (mi) in R: Opening Windows into the Black Box

Su, Yu-Sung; Gelman, Andrew E.; Hill, Jennifer; Yajima, Masanao

Our mi package in R has several features that allow the user to get inside the imputation process and evaluate the reasonableness of the resulting models and imputations. These features include: choice of predictors, models, and transformations for chained imputation models; standard and binned residual plots for checking the fit of the conditional distributions used for imputation; and plots for comparing the distributions of observed and imputed data. In addition, we use Bayesian models and weakly informative prior distributions to construct more stable estimates of imputation models. Our goal is to have a demonstration package that (a) avoids many of the practical problems that arise with existing multivariate imputation programs, and (b) demonstrates state-of-the-art diagnostics that can be applied more generally and can be incorporated into the software of others.

Subjects

Files

Also Published In

Title
Journal of Statistical Software

More About This Work

Academic Units
Statistics
Published Here
November 20, 2012
Academic Commons provides global access to research and scholarship produced at Columbia University, Barnard College, Teachers College, Union Theological Seminary and Jewish Theological Seminary. Academic Commons is managed by the Columbia University Libraries.