An Automated Approach to Causal Inference in Discrete Settings

被引:5
作者
Duarte, Guilherme [1 ]
Finkelstein, Noam [2 ]
Knox, Dean [1 ]
Mummolo, Jonathan [3 ,4 ]
Shpitser, Ilya [2 ]
机构
[1] Univ Penn, Wharton Sch, Operat Informat & Decis Dept, Philadelphia, PA 19104 USA
[2] Johns Hopkins Univ, Whiting Sch Engn, Dept Comp Sci, Baltimore, MD USA
[3] Princeton Univ, Dept Polit, Princeton, NJ USA
[4] Princeton Univ, Sch Publ & Int Affairs, Princeton, NJ USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Causal inference; Constrained optimization; Partial identification; Linear programming; Polynomial programming; PRINCIPAL STRATIFICATION; BOUNDS; MARGINS;
D O I
10.1080/01621459.2023.2216909
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Applied research conditions often make it impossible to point-identify causal estimands without untenable assumptions. Partial identification-bounds on the range of possible solutions-is a principled alternative, but the difficulty of deriving bounds in idiosyncratic settings has restricted its application. We present a general, automated numerical approach to causal inference in discrete settings. We show causal questions with discrete data reduce to polynomial programming problems, then present an algorithm to automatically bound causal effects using efficient dual relaxation and spatial branch-and-bound techniques. The user declares an estimand, states assumptions, and provides data-however incomplete or mismeasured. The algorithm then searches over admissible data-generating processes and outputs the most precise possible range consistent with available information-that is, sharp bounds-including a point-identified solution if one exists. Because this search can be computationally intensive, our procedure reports and continually refines non-sharp ranges guaranteed to contain the truth at all times, even when the algorithm is not run to completion. Moreover, it offers an epsilon-sharpness guarantee, characterizing the worst-case looseness of the incomplete bounds. These techniques are implemented in our Python package, autobounds. Analytically validated simulations show the method accommodates classic obstacles-including confounding, selection, measurement error, noncompliance, and nonresponse. Supplementary materials for this article are available online.
引用
收藏
页码:1778 / 1793
页数:16
相关论文
共 42 条
  • [11] Graphs for Margins of Bayesian Networks
    Evans, Robin J.
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2016, 43 (03) : 625 - 648
  • [12] Finkelstein N., 2021, WORKING PAPER
  • [13] Finkelstein N., 2020, ARXIV
  • [14] Principal stratification in causal inference
    Frangakis, CE
    Rubin, DB
    [J]. BIOMETRICS, 2002, 58 (01) : 21 - 29
  • [15] Causal Bounds for Outcome-Dependent Sampling in Observational Studies
    Gabriel, Erin E.
    Sachs, Michael C.
    Sjolander, Arvid
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (538) : 939 - 950
  • [16] Geiger D, 1999, UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, P226
  • [17] IDENTIFIABILITY, EXCHANGEABILITY, AND EPIDEMIOLOGIC CONFOUNDING
    GREENLAND, S
    ROBINS, JM
    [J]. INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 1986, 15 (03) : 413 - 419
  • [18] An introduction to variational methods for graphical models
    Jordan, MI
    Ghahramani, Z
    Jaakkola, TS
    Saul, LK
    [J]. MACHINE LEARNING, 1999, 37 (02) : 183 - 233
  • [19] Survivor-Complier Effects in the Presence of Selection on Treatment, With Application to a Study of Prompt ICU Admission
    Kennedy, Edward H.
    Harris, Steve
    Keele, Luke J.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (525) : 93 - 104
  • [20] Administrative Records Mask Racially Biased Policing
    Knox, Dean
    Lowe, Will
    Mummolo, Jonathan
    [J]. AMERICAN POLITICAL SCIENCE REVIEW, 2020, 114 (03) : 619 - 637