Graphical Representation of Missing Data Problems

被引:46
|
作者
Thoemmes, Felix [1 ]
Mohan, Karthika [2 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
[2] Univ Calif Los Angeles, Los Angeles, CA 90024 USA
基金
美国国家科学基金会;
关键词
auxiliary variables; full information; graphical models; maximum likelihood; missing data; multiple imputation; MULTIPLE IMPUTATION; CAUSAL;
D O I
10.1080/10705511.2014.937378
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Rubin's classic missingness mechanisms are central to handling missing data and minimizing biases that can arise due to missingness. However, the formulaic expressions that posit certain independencies among missing and observed data are difficult to grasp. As a result, applied researchers often rely on informal translations of these assumptions. We present a graphical representation of missing data mechanism, formalized in Mohan, Pearl, and Tian (2013). We show that graphical models provide a tool for comprehending, encoding, and communicating assumptions about the missingness process. Furthermore, we demonstrate on several examples how graph-theoretical criteria can determine if biases due to missing data might emerge in some estimates of interests and which auxiliary variables are needed to control for such biases, given assumptions about the missingness process.
引用
收藏
页码:631 / 642
页数:12
相关论文
共 50 条
  • [31] Performance of Missing Data Approaches Under Non ignorable Missing Data Conditions
    Pohl, Steffi
    Becker, Benjamin
    METHODOLOGY-EUROPEAN JOURNAL OF RESEARCH METHODS FOR THE BEHAVIORAL AND SOCIAL SCIENCES, 2020, 16 (02) : 147 - 165
  • [32] Missing data: Discussion points from the PSI missing data expert group
    Burzykowski, Tomasz
    Carpenter, James
    Coens, Corneel
    Evans, Daniel
    France, Lesley
    Kenward, Mike
    Lane, Peter
    Matcham, James
    Morgan, David
    Phillips, Alan
    Roger, James
    Sullivan, Brian
    White, Ian
    Yu, Ly-Mee
    PHARMACEUTICAL STATISTICS, 2010, 9 (04) : 288 - 297
  • [33] Subsample ignorable likelihood for regression analysis with missing data
    Little, Roderick J.
    Zhang, Nanhua
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2011, 60 : 591 - 605
  • [34] A Likelihood-Based Approach for Missing Genotype Data
    D'Angelo, Gina M.
    Kamboh, M. Ilyas
    Feingold, Eleanor
    HUMAN HEREDITY, 2010, 69 (03) : 171 - 183
  • [35] Regularized extreme learning machine for regression with missing data
    Yu, Qi
    Miche, Yoan
    Eirola, Emil
    van Heeswijk, Mark
    Severin, Eric
    Lendasse, Amaury
    NEUROCOMPUTING, 2013, 102 : 45 - 51
  • [36] Methods for addressing missing data in psychiatric and developmental research
    Croy, CD
    Novins, DK
    JOURNAL OF THE AMERICAN ACADEMY OF CHILD AND ADOLESCENT PSYCHIATRY, 2005, 44 (12) : 1230 - 1240
  • [37] Missing data methods for arbitrary missingness with small samples
    McNeish, Daniel
    JOURNAL OF APPLIED STATISTICS, 2017, 44 (01) : 24 - 39
  • [38] Factorization of posteriors and partial imputation algorithm for graphical models with missing data
    Geng, Z
    Li, KC
    STATISTICS & PROBABILITY LETTERS, 2003, 64 (04) : 369 - 379
  • [39] Why Missing Data Matter in the Longitudinal Study of Adolescent Development: Using the 4-H Study to Understand the Uses of Different Missing Data Methods
    Helena Jeličić
    Erin Phelps
    Richard M. Lerner
    Journal of Youth and Adolescence, 2010, 39 : 816 - 835
  • [40] Why Missing Data Matter in the Longitudinal Study of Adolescent Development: Using the 4-H Study to Understand the Uses of Different Missing Data Methods
    Jelicic, Helena
    Phelps, Erin
    Lerner, Richard M.
    JOURNAL OF YOUTH AND ADOLESCENCE, 2010, 39 (07) : 816 - 835