Graphical Representation of Missing Data Problems

被引:46
|
作者
Thoemmes, Felix [1 ]
Mohan, Karthika [2 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
[2] Univ Calif Los Angeles, Los Angeles, CA 90024 USA
基金
美国国家科学基金会;
关键词
auxiliary variables; full information; graphical models; maximum likelihood; missing data; multiple imputation; MULTIPLE IMPUTATION; CAUSAL;
D O I
10.1080/10705511.2014.937378
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Rubin's classic missingness mechanisms are central to handling missing data and minimizing biases that can arise due to missingness. However, the formulaic expressions that posit certain independencies among missing and observed data are difficult to grasp. As a result, applied researchers often rely on informal translations of these assumptions. We present a graphical representation of missing data mechanism, formalized in Mohan, Pearl, and Tian (2013). We show that graphical models provide a tool for comprehending, encoding, and communicating assumptions about the missingness process. Furthermore, we demonstrate on several examples how graph-theoretical criteria can determine if biases due to missing data might emerge in some estimates of interests and which auxiliary variables are needed to control for such biases, given assumptions about the missingness process.
引用
收藏
页码:631 / 642
页数:12
相关论文
共 50 条
  • [1] Bayesian methods for dealing with missing data problems
    Ma, Zhihua
    Chen, Guanghui
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2018, 47 (03) : 297 - 313
  • [2] Graphical Models for Processing Missing Data
    Mohan, Karthika
    Pearl, Judea
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (534) : 1023 - 1037
  • [4] Handling of missing data in psychological research:: Problems and solutions
    Luedtke, Oliver
    Robitzsch, Alexander
    Trautwein, Ulrich
    Koeller, Olaf
    PSYCHOLOGISCHE RUNDSCHAU, 2007, 58 (02) : 103 - 117
  • [5] Handling Missing Data with Graph Representation Learning
    You, Jiaxuan
    Ma, Xiaobai
    Ding, Daisy Yi
    Kochenderfer, Mykel
    Leskovec, Jure
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [6] Using causal diagrams to guide analysis in missing data problems
    Daniel, Rhian M.
    Kenward, Michael G.
    Cousens, Simon N.
    De Stavola, Bianca L.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2012, 21 (03) : 243 - 256
  • [7] Mixed graphical models with missing data and the partial imputation EM algorithm
    Geng, Z
    Wan, K
    Tao, F
    SCANDINAVIAN JOURNAL OF STATISTICS, 2000, 27 (03) : 433 - 444
  • [8] Principled Missing Data Treatments
    Lang, Kyle M.
    Little, Todd D.
    PREVENTION SCIENCE, 2018, 19 (03) : 284 - 294
  • [9] Auxiliary Variables in Multiple Imputation When Data Are Missing Not at Random
    Mustillo, Sarah
    Kwon, Soyoung
    JOURNAL OF MATHEMATICAL SOCIOLOGY, 2015, 39 (02) : 73 - 91
  • [10] High-dimensional, outcome-dependent missing data problems: Models for the human KIR loci
    van der Burg, Lars Leonardus Joannes
    Putter, Hein
    Baldauf, Henning
    Sauter, Juergen
    Schetelig, Johannes
    de Wreede, Liesbeth C.
    Boehringer, Stefan
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2025, : 440 - 456