Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jan;4(2):30.
doi: 10.3978/j.issn.2305-5839.2015.12.63.

Multiple imputation with multivariate imputation by chained equation (MICE) package

Affiliations

Multiple imputation with multivariate imputation by chained equation (MICE) package

Zhongheng Zhang. Ann Transl Med. 2016 Jan.

Abstract

Multiple imputation (MI) is an advanced technique for handing missing values. It is superior to single imputation in that it takes into account uncertainty in missing value imputation. However, MI is underutilized in medical literature due to lack of familiarity and computational challenges. The article provides a step-by-step approach to perform MI by using R multivariate imputation by chained equation (MICE) package. The procedure firstly imputed m sets of complete dataset by calling mice() function. Then statistical analysis such as univariate analysis and regression model can be performed within each dataset by calling with() function. This function sets the environment for statistical analysis. Lastly, the results obtained from each analysis are combined by using pool() function.

Keywords: Big-data clinical trial; R; imputed complete dataset; multiple imputation (MI); multivariate imputation by chained equation (MICE) package.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: The author has no conflicts of interest to declare.

Figures

None
Zhongheng Zhang, MMed.
Figure 1
Figure 1
Schematic illustration of how MICE package works with data frame with missing values. Note the sequential use of mice(), with() and pool() functions.

Similar articles

Cited by

References

    1. Donders AR, van der Heijden GJ, Stijnen T, et al. Review: a gentle introduction to imputation of missing values. J Clin Epidemiol 2006;59:1087-91. - PubMed
    1. Morris TP, White IR, Royston P. Tuning multiple imputation by predictive mean matching and local residual draws. BMC Med Res Methodol 2014;14:75. - PMC - PubMed
    1. White IR, Royston P, Wood AM. Multiple imputation using chained equations: Issues and guidance for practice. Stat Med 2011;30:377-99. - PubMed
    1. Moons KG, Donders RA, Stijnen T, et al. Using the outcome for imputation of missing predictor values was preferred. J Clin Epidemiol 2006;59:1092-101. - PubMed
    1. Zhang Z, Chen K, Ni H, et al. Predictive value of lactate in unselected critically ill patients: an analysis using fractional polynomials. J Thorac Dis 2014;6:995-1003. - PMC - PubMed