Sensitivity analysis for multivariable missing data using multiple imputation: a tutorial

  • Cattram D Nguyen*
  • , Katherine J Lee
  • , Ian R White
  • , Stef van Buuren
  • , Margarita Moreno-Betancur
  • *Corresponding author for this work

Research output: Working paperPreprintAcademic

Abstract

Multiple imputation is a popular method for handling missing data, with fully conditional specification (FCS) being one of the predominant imputation approaches for multivariable missingness. Unbiased estimation with standard implementations of multiple imputation depends on assumptions concerning the missingness mechanism (e.g. that data are "missing at random"). The plausibility of these assumptions can only be assessed using subject-matter knowledge, and not data alone. It is therefore important to perform sensitivity analyses to explore the robustness of results to violations of these assumptions (e.g. if the data are in fact "missing not at random"). In this tutorial, we provide a roadmap for conducting sensitivity analysis using the Not at Random Fully Conditional Specification (NARFCS) procedure for multivariate imputation. Using a case study from the Longitudinal Study of Australian Children, we work through the steps involved, from assessing the need to perform the sensitivity analysis, and specifying the NARFCS models and sensitivity parameters, through to implementing NARFCS using FCS procedures in R and Stata.
Original languageEnglish
PublisherarXiv
Number of pages24
DOIs
Publication statusPublished - 5 Feb 2025

Bibliographical note

24 pages, 3 figures, 2 tables

Keywords

  • stat.ME

Fingerprint

Dive into the research topics of 'Sensitivity analysis for multivariable missing data using multiple imputation: a tutorial'. Together they form a unique fingerprint.

Cite this