Multiple Imputation of Predictor Variables Using Generalized Additive Models

Roel de Jong, Stef van Buuren, Martin Spiess*

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

The sensitivity of multiple imputation methods to deviations from their distributional assumptions is investigated using simulations, where the parameters of scientific interest are the coefficients of a linear regression model, and values in predictor variables are missing at random. The performance of a newly proposed imputation method based on generalized additive models for location, scale, and shape (GAMLSS) is investigated. Although imputation methods based on predictive mean matching are virtually unbiased, they suffer from mild to moderate under-coverage, even in the experiment where all variables are jointly normal distributed. The GAMLSS method features better coverage than currently available methods.

Original languageEnglish
Pages (from-to)968-985
JournalCommunications in Statistics - Simulation and Computation
Volume45
DOIs
Publication statusPublished - 2016

Keywords

  • Comparison of imputation methods
  • Generalized additive models for location
  • Linear model
  • Multiple imputation
  • Predictive mean matching
  • Robust imputation models
  • Scale and shape.

Fingerprint

Dive into the research topics of 'Multiple Imputation of Predictor Variables Using Generalized Additive Models'. Together they form a unique fingerprint.

Cite this