How many imputations are really needed? Some practical clarifications of multiple imputation theory.

abstract

Multiple imputation (MI) and full information maximum likelihood (FIML) are the two most common approaches to missing data analysis. In theory, MI and FIML are equivalent when identical models are tested using the same variables, and when m, the number of imputations performed with MI, approaches infinity. However, it is important to know how many imputations are necessary before MI and FIML are sufficiently equivalent in ways that are important to prevention scientists. MI theory suggests that small values of m, even on the order of three to five imputations, yield excellent results. Previous guidelines for sufficient m are based on relative efficiency, which involves the fraction of missing information (gamma) for the parameter being estimated, and m. In the present study, we used a Monte Carlo simulation to test MI models across several scenarios in which gamma and m were varied. Standard errors and p-values for the regression coefficient of interest varied as a function of m, but not at the same rate as relative efficiency. Most importantly, statistical power for small effect sizes diminished as m became smaller, and the rate of this power falloff was much greater than predicted by changes in relative efficiency. Based our findings, we recommend that researchers using MI should perform many more imputations than previously considered sufficient. These recommendations are based on gamma, and take into consideration one's tolerance for a preventable power falloff (compared to FIML) due to using too few imputations.

authors

Gilreath, Tamika

published proceedings

Prev Sci

altmetric score

14.5

author list (cited authors)

Graham, J. W., Olchowski, A. E., & Gilreath, T. D.

citation count

1804

complete list of authors

Graham, John W||Olchowski, Allison E||Gilreath, Tamika D

publication date

January 2007

publisher

Springer Nature Publisher

published in

Prevention Science Journal

keywords

Data Interpretation, Statistical
Humans
Likelihood Functions
Models, Statistical
Monte Carlo Method
Preventive Medicine
Sample Size

Digital Object Identifier (DOI)

10.1007/s11121-007-0070-9

start page

206

end page

213

volume

8

issue

3

URL

http://dx.doi.org/10.1007/s11121-007-0070-9

How many imputations are really needed? Some practical clarifications of multiple imputation theory. Academic Article

Overview

abstract

authors

published proceedings

altmetric score

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL