Principled missing data methods for researchers

被引:1415
作者
Dong, Yiran [1 ]
Peng, Chao-Ying Joanne [1 ]
机构
[1] Indiana Univ, Bloomington, IN 47405 USA
来源
SPRINGERPLUS | 2013年 / 2卷
关键词
Missing data; Listwise deletion; MI; Gamma IML; EM; MAR; MCAR; MNAR; MULTIPLE IMPUTATION; MAXIMUM-LIKELIHOOD; CHAINED EQUATIONS; PERFORMANCE; SOFTWARE; UPDATE; VALUES; STATE;
D O I
10.1186/2193-1801-2-222
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The impact of missing data on quantitative research can be serious, leading to biased estimates of parameters, loss of information, decreased statistical power, increased standard errors, and weakened generalizability of findings. In this paper, we discussed and demonstrated three principled missing data methods: multiple imputation, full information maximum likelihood, and expectation-maximization algorithm, applied to a real-world data set. Results were contrasted with those obtained from the complete data set and from the listwise deletion method. The relative merits of each method are noted, along with common features they share. The paper concludes with an emphasis on the importance of statistical assumptions, and recommendations for researchers. Quality of research will be enhanced if (a) researchers explicitly acknowledge missing data problems and the conditions under which they occurred, (b) principled methods are employed to handle missing data, and (c) the appropriate treatment of missing data is incorporated into review standards of manuscripts submitted for publication.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [41] Missing data in bioarchaeology II: A test of ordinal and continuous data imputation
    Wissler, Amanda
    Blevins, Kelly E.
    Buikstra, Jane E.
    AMERICAN JOURNAL OF BIOLOGICAL ANTHROPOLOGY, 2022, 179 (03): : 349 - 364
  • [42] Methods of solving missing data issues in credit risk scoring and comparison of its effectiveness
    Soldatyuk, Nataliya
    Sopko, Stanislav
    MATHEMATICAL METHODS IN ECONOMICS (MME 2014), 2014, : 926 - 931
  • [43] Identifying reprioritization response shift in a stroke caregiver population: a comparison of missing data methods
    Sajobi, Tolulope T.
    Lix, Lisa M.
    Singh, Gurbakhshash
    Lowerison, Mark
    Engbers, Jordan
    Mayo, Nancy E.
    QUALITY OF LIFE RESEARCH, 2015, 24 (03) : 529 - 540
  • [44] Imputation methods for addressing missing data in short-term monitoring of air pollutants
    Hadeed, Steven J.
    O'Rourke, Mary Kay
    Burgess, Jefferey L.
    Harris, Robin B.
    Canales, Robert A.
    SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 730 (730)
  • [45] Tensor-Based Methods for Handling Missing Data in Quality-of-Life Questionnaires
    Garg, Lalit
    Dauwels, Justin
    Earnest, Arul
    Leong, Khai Pang
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2014, 18 (05) : 1571 - 1580
  • [46] The Effects of Missing Data Handling Methods on Reliability Coefficients: A Monte Carlo Simulation Study
    Kacak, Tugay
    Kilic, Abdullah Faruk
    JOURNAL OF MEASUREMENT AND EVALUATION IN EDUCATION AND PSYCHOLOGY-EPOD, 2024, 15 (02): : 166 - 182
  • [47] A novel model to optimize multiple imputation algorithm for missing data using evolution methods
    Mohammed, Yasser Salaheldin
    Abdelkader, Hatem
    Plawiak, Pawel
    Hammad, Mohamed
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 76
  • [48] Analyzing data sets with missing data: An empirical evaluation of imputation methods and likelihood-based methods
    Myrtveit, I
    Stensrud, E
    Olsson, UH
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2001, 27 (11) : 999 - 1013
  • [49] An Investigation of Missing Data Methods for Classification Trees Applied to Binary Response Data
    Ding, Yufeng
    Simonoff, Jeffrey S.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 131 - 170
  • [50] Missing in Action: A Case Study of the Application of Methods for Dealing With Missing Data to Trauma System Benchmarking
    O'Reilly, Gerard M.
    Jolley, Damien J.
    Cameron, Peter A.
    Gabbe, Belinda
    ACADEMIC EMERGENCY MEDICINE, 2010, 17 (10) : 1122 - 1129