Graphical Representation of Missing Data Problems

被引:46
|
作者
Thoemmes, Felix [1 ]
Mohan, Karthika [2 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
[2] Univ Calif Los Angeles, Los Angeles, CA 90024 USA
基金
美国国家科学基金会;
关键词
auxiliary variables; full information; graphical models; maximum likelihood; missing data; multiple imputation; MULTIPLE IMPUTATION; CAUSAL;
D O I
10.1080/10705511.2014.937378
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Rubin's classic missingness mechanisms are central to handling missing data and minimizing biases that can arise due to missingness. However, the formulaic expressions that posit certain independencies among missing and observed data are difficult to grasp. As a result, applied researchers often rely on informal translations of these assumptions. We present a graphical representation of missing data mechanism, formalized in Mohan, Pearl, and Tian (2013). We show that graphical models provide a tool for comprehending, encoding, and communicating assumptions about the missingness process. Furthermore, we demonstrate on several examples how graph-theoretical criteria can determine if biases due to missing data might emerge in some estimates of interests and which auxiliary variables are needed to control for such biases, given assumptions about the missingness process.
引用
收藏
页码:631 / 642
页数:12
相关论文
共 50 条
  • [41] CDRM: Causal disentangled representation learning for missing data
    Chen, Mingjie
    Wang, Hongcheng
    Wang, Ruxin
    Peng, Yuzhong
    Zhang, Hao
    KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [42] Data variability in the imputation quality of missing data
    Stochero, Elisandra Lucia Moro
    Lucio, Alessandro Dal'Col
    Jacobi, Luciane Flores
    ACTA SCIENTIARUM-AGRONOMY, 2024, 46
  • [43] On weighting approaches for missing data
    Li, Lingling
    Shen, Changyu
    Li, Xiaochun
    Robins, James M.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2013, 22 (01) : 14 - 30
  • [44] Dealing with deficient and missing data
    Dohoo, Ian R.
    PREVENTIVE VETERINARY MEDICINE, 2015, 122 (1-2) : 221 - 228
  • [45] Methods for clustered encouragement design studies with noncompliance and missing data
    Taylor, Leslie
    Zhou, Xiao-Hua
    BIOSTATISTICS, 2011, 12 (02) : 313 - 326
  • [46] Multiple imputation for missing data
    Patrician, PA
    RESEARCH IN NURSING & HEALTH, 2002, 25 (01) : 76 - 84
  • [47] Missing Data Imputation: A Survey
    Kelkar, Bhagyashri Abhay
    INTERNATIONAL JOURNAL OF DECISION SUPPORT SYSTEM TECHNOLOGY, 2022, 14 (01)
  • [48] Missing data matter: an empirical evaluation of the impacts of missing EHR data in comparative effectiveness research
    Zhou, Yizhao
    Shi, Jiasheng
    Stein, Ronen
    Liu, Xiaokang
    Baldassano, Robert N.
    Forrest, Christopher B.
    Chen, Yong
    Huang, Jing
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2023, 30 (07) : 1246 - 1256
  • [49] Missing data in cross-sectional networks - An extensive comparison of missing data treatment methods
    Krause, Robert W.
    Huisman, Mark
    Steglich, Christian
    Snijders, Tom
    SOCIAL NETWORKS, 2020, 62 : 99 - 112
  • [50] Rank Estimation in Missing Data Matrix Problems
    Carme Julià
    Angel D. Sappa
    Felipe Lumbreras
    Joan Serrat
    Antonio López
    Journal of Mathematical Imaging and Vision, 2011, 39 : 140 - 160