Statistical Methods

Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation study


Abstract


Background

The purpose of this simulation study is to assess the performance of multiple imputation compared to complete case analysis when assumptions of missing data mechanisms are violated.

Methods

The authors performed a stochastic simulation study to assess the performance of Complete Case (CC) analysis and Multiple Imputation (MI) with different missing data mechanisms (missing completely at random (MCAR), at random (MAR), and not at random (MNAR)). The study focused on the point estimation of regression coefficients and standard errors.

Results

When data were MAR conditional on Y, CC analysis resulted in biased regression coefficients; they were all underestimated in our scenarios. In these scenarios, analysis after MI gave correct estimates. Yet, in case of MNAR MI yielded biased regression coefficients, while CC analysis performed well.
Conclusion

The authors demonstrated that MI was only superior to CC analysis in case of MCAR or MAR. In some scenarios CC may be superior over MI. Often it is not feasible to identify the reason why data in a given dataset are missing. Therefore, emphasis should be put on reporting the extent of missing values, the method used to address them, and the assumptions that were made about the mechanism that caused missing data.

Full Text:

PDF


DOI: https://doi.org/10.2427/11598

NBN: http://nbn.depositolegale.it/urn%3Anbn%3Ait%3Aprex-16690

References



Article Metrics

Metrics Loading ...

Metrics powered by PLOS ALM

Refbacks

  • There are currently no refbacks.


Copyright (c) 2016 Epidemiology, Biostatistics and Public Health

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it (Read more).
Ok


EBPH Epidemiology, Biostatistics and Public Health | ISSN 2282-0930

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.