Generalizing Univariate Predictive Mean Matching to Impute Multiple Variables Simultaneously

Mingyang Cai*, Stef van Buuren, Gerko Vink

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

Predictive mean matching (PMM) is an easy-to-use and versatile univariate imputation approach. It is robust against transformations of the incomplete variable and violation of the normal model. However, univariate imputation methods cannot directly preserve multivariate relations in the imputed data. We wish to extend PMM to a multivariate method to produce imputations that are consistent with the knowledge of derived data (e.g., data transformations, interactions, sum restrictions, range restrictions, and polynomials). This paper proposes multivariate predictive mean matching (MPMM), which can impute incomplete variables simultaneously. Instead of the normal linear model, we apply canonical regression analysis to calculate the predicted value used for donor selection. To evaluate the performance of MPMM, we compared it with other imputation approaches under four scenarios: 1) multivariate normal distributed data, 2) linear regression with quadratic terms; 3) linear regression with interaction terms; 4) incomplete data with inequality restrictions. The simulation study shows that with moderate missingness patterns, MPMM provides plausible imputations at the univariate level and preserves relations in the data.

Original languageEnglish
Title of host publicationIntelligent Computing
Subtitle of host publicationProceedings of the 2022 Computing Conference, Volume 1
EditorsKohei Arai
Place of PublicationCham
PublisherSpringer
Pages75-91
Number of pages17
ISBN (Electronic)978-3-031-10461-9
ISBN (Print)978-3-031-10460-2
DOIs
Publication statusPublished - 7 Jul 2022
EventComputing Conference, 2022 - Virtual, Online
Duration: 14 Jul 202215 Jul 2022

Publication series

NameLecture Notes in Networks and Systems
PublisherSpringer
Volume506
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

ConferenceComputing Conference, 2022
CityVirtual, Online
Period14/07/2215/07/22

Keywords

  • Block imputation
  • Canonical regression analysis
  • Missing data
  • Multiple imputation
  • Multivariate analysis
  • Predictive mean matching

Fingerprint

Dive into the research topics of 'Generalizing Univariate Predictive Mean Matching to Impute Multiple Variables Simultaneously'. Together they form a unique fingerprint.

Cite this