A structured comparison of causal machine learning methods to assess heterogeneous treatment effects in spatial data

被引:11
作者
Credit, Kevin [1 ]
Lehnert, Matthew [2 ]
机构
[1] Maynooth Univ, Natl Ctr Geocomputat, Maynooth, Co Kildare, Ireland
[2] Satelytics, Perrysburg, OH USA
关键词
Causal forest; Heterogeneous treatment effects; Machine learning; Causal inference; Spatial; CO2; emissions; Transit; INFERENCE; LAND; LESSONS; MODELS; CO2;
D O I
10.1007/s10109-023-00413-0
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
The development of the "causal" forest by Wager and Athey (J Am Stat Assoc 113(523): 1228-1242, 2018) represents a significant advance in the area of explanatory/causal machine learning. However, this approach has not yet been widely applied to geographically referenced data, which present some unique issues: the random split of the test and training sets in the typical causal forest design fractures the spatial fabric of geographic data. To help solve this issue, we use a simulated dataset with known properties for average treatment effects and conditional average treatment effects to compare the performance of CF models across different definitions of the test/train split. We also develop a new "spatial" T-learner that can be implemented using predictive methods like random forest to provide estimates of heterogeneous treatment effects across all units. Our results show that all of the machine learning models outperform traditional ordinary least squares regression at identifying the true average treatment effect, but are not significantly different from one another. We then apply the preferred causal forest model in the context of analysing the treatment effect of the construction of the Valley Metro light rail (tram) system on on-road CO2 emissions per capita at the block group level in Maricopa County, Arizona, and find that the neighbourhoods most likely to benefit from treatment are those with higher pre-treatment proportions of transit and pedestrian commuting and lower proportions of auto commuting.
引用
收藏
页码:483 / 510
页数:28
相关论文
共 70 条
  • [1] Anselin L., 2014, MODERN SPATIAL ECONO
  • [2] Athey S., 2019, Observ Stud, V5, P37, DOI DOI 10.1353/OBS.2019.0001
  • [3] GENERALIZED RANDOM FORESTS
    Athey, Susan
    Tibshirani, Julie
    Wager, Stefan
    [J]. ANNALS OF STATISTICS, 2019, 47 (02) : 1148 - 1178
  • [4] Recursive partitioning for heterogeneous causal effects
    Athey, Susan
    Imbens, Guido
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (27) : 7353 - 7360
  • [5] Bailey L., 2008, The broader connection between public transportation, energy conservation and greenhouse gas reduction
  • [6] Baylis K, 2015, 2015 AAEA WAEA JOINT
  • [7] Do high income households reduce driving more when living near rail transit?
    Boarnet, Marlon G.
    Bostic, Raphael W.
    Rodnyansky, Seva
    Burinskiy, Evgeny
    Eisenlohr, Andrew
    Jamme, Hue-Tam
    Santiago-Bartolomei, Raul
    [J]. TRANSPORTATION RESEARCH PART D-TRANSPORT AND ENVIRONMENT, 2020, 80
  • [8] Systemic And Structural Racism: Definitions, Examples, Health Damages, And Approaches To Dismantling
    Braveman, Paula A.
    Arkin, Elaine
    Proctor, Dwayne
    Kauh, Tina
    Holm, Nicole
    [J]. HEALTH AFFAIRS, 2022, 41 (02) : 171 - 178
  • [9] Butts K., 2021, ARXIV
  • [10] Calthorpe P, 1993, The next American metropolis: ecology, community, and the American dream