Multiple Imputation with Predictive Mean Matching Method for Numerical Missing Data

被引:10
作者
Akmam, Emha Fathul [1 ]
Siswantining, Titin [1 ]
Soemartojo, Saskya Mary [1 ]
Sarwinda, Devvi [1 ]
机构
[1] Univ Indonesia, Dept Math, Fac Math & Nat Sci, Depok, Indonesia
来源
2019 3RD INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS 2019) | 2019年
关键词
linear regression analysis; multiple imputation; missing values; predictive mean matching;
D O I
10.1109/icicos48119.2019.8982510
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Missing data are condition when there are some missing values or empty entries on several observations on data. It could inhibit statistical analysis process and might give a bias conclusion from the analysis if couldn't be handled properly. This problem can be found on some linear regression analysis. One way to handle this problem is using multiple imputation (MI) method named Predictive Mean Matching (PMM). PMM will matching the predictive mean distance of incomplete observations with the complete observations. To get the multiple imputation concept, the predictive mean of incomplete observations were estimated by Bayesian approach while the complete observations were estimated with ordinary least square. Thus, the complete observation that has the closest distance will be a donor value for the incomplete one. Simulation data with two variable (x and y), univariate missing data pattern (on y), and MAR mechanism is used to analyzed the effectiveness of PMM based on relative efficiency estimation result of missing covariate data. Regression analysis used x as independent variable and y as dependent variable. The result showed that PMM give a significant coefficient regression parameter at 5% level of significance and only loss 1% of relative efficiency.
引用
收藏
页数:6
相关论文
共 50 条
[41]   Generalizing Univariate Predictive Mean Matching to Impute Multiple Variables Simultaneously [J].
Cai, Mingyang ;
van Buuren, Stef ;
Vink, Gerko .
INTELLIGENT COMPUTING, VOL 1, 2022, 506 :75-91
[42]   MULTIPLY ROBUST NONPARAMETRIC MULTIPLE IMPUTATION FOR THE TREATMENT OF MISSING DATA [J].
Chen, Sixia ;
Haziza, David .
STATISTICA SINICA, 2019, 29 (04) :2035-2053
[43]   A Noise-Aware Multiple Imputation Algorithm for Missing Data [J].
Li, Fangfang ;
Sun, Hui ;
Gu, Yu ;
Yu, Ge .
MATHEMATICS, 2023, 11 (01)
[44]   Multiple Imputation of Missing Data in Cross-Cultural Samples [J].
Dow, Malcolm M. ;
Eff, E. Anthon .
CROSS-CULTURAL RESEARCH, 2009, 43 (03) :206-229
[45]   Best practices for addressing missing data through multiple imputation [J].
Woods, Adrienne D. ;
Gerasimova, Daria ;
Van Dusen, Ben ;
Nissen, Jayson ;
Bainter, Sierra ;
Uzdavines, Alex ;
Davis-Kean, Pamela E. ;
Halvorson, Max ;
King, Kevin M. ;
Logan, Jessica A. R. ;
Xu, Menglin ;
Vasilev, Martin R. ;
Clay, James M. ;
Moreau, David ;
Joyal-Desmarais, Keven ;
Cruz, Rick A. ;
Brown, Denver M. Y. ;
Schmidt, Kathleen ;
Elsherif, Mahmoud M. .
INFANT AND CHILD DEVELOPMENT, 2024, 33 (01)
[46]   A bias-corrected estimator in multiple imputation for missing data [J].
Tomita, Hiroaki ;
Fujisawa, Hironori ;
Henmi, Masayuki .
STATISTICS IN MEDICINE, 2018, 37 (23) :3373-3386
[47]   Partial distance evidential clustering for missing data with multiple imputation [J].
Tian, Hong-Peng ;
Zhang, Zhen .
KNOWLEDGE-BASED SYSTEMS, 2025, 310
[48]   Multiple Imputation of Missing Data for Multilevel Models: Simulations and Recommendations [J].
Grund, Simon ;
Luedtke, Oliver ;
Robitzsch, Alexander .
ORGANIZATIONAL RESEARCH METHODS, 2018, 21 (01) :111-149
[49]   Multiple Imputation for Missing Data in Inventory on Dominant Tree Height [J].
Zhang Xiangyun ;
Song Hongfeng .
RECENT ADVANCE IN STATISTICS APPLICATION AND RELATED AREAS, VOLS I AND II, 2009, :463-468
[50]   Multiple imputation of missing fMRI data in whole brain analysis [J].
Vaden, Kenneth I., Jr. ;
Gebregziabher, Mulugeta ;
Kuchinsky, Stefanie E. ;
Eckert, Marl A. .
NEUROIMAGE, 2012, 60 (03) :1843-1855