Ranking the importance of demographic, socioeconomic, and underlying health factors on US COVID-19 deaths: A geographical random forest approach

被引:90
作者
Grekousis, George [1 ,2 ,3 ]
Feng, Zhixin [1 ]
Marakakis, Ioannis [4 ]
Lu, Yi [5 ,6 ]
Wang, Ruoyu [7 ]
机构
[1] Sun Yat Sen Univ, Sch Geog & Planning, Dept Urban & Reg Planning, Xingang Xi Rd, Guangzhou 510275, Peoples R China
[2] Guangdong Key Lab Urbanizat & Geosimulat, Guangzhou, Peoples R China
[3] Guangdong Prov Engn Res Ctr Publ Secur & Disaster, Guangzhou, Peoples R China
[4] Natl Tech Univ Athens, Dept Geog & Reg Planning, Sch Rural & Surveying Engn, Zografou Campus, Athens 15780, Greece
[5] City Univ Hong Kong, Dept Architecture & Civil Engn, Hong Kong, Peoples R China
[6] City Univ Hong Kong, Shenzhen Res Inst, Shenzhen, Peoples R China
[7] Univ Edinburgh, Inst Geog, Sch Geosci, Edinburgh, Scotland
关键词
spatial Machine learning; Random forest; COVID-19; USA; CLASSIFICATION; MORTALITY; WILL;
D O I
10.1016/j.healthplace.2022.102744
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
A growing number of studies show that the uneven spatial distribution of COVID-19 deaths is related to demographic and socioeconomic disparities across space. However, most studies fail to assess the relative importance of each factor to COVID-19 death rate and, more importantly, how this importance varies spatially. Here, we assess the variables that are more important locally using Geographical Random Forest (GRF), a local nonlinear regression method. Through GRF, we estimated the non-linear relationships between the COVID-19 death rate and 29 socioeconomic and health-related factors during the first year of the pandemic in the USA (county level). GRF outputs are compared to global (Random Forest and OLS) and local (Geographically Weighted Regression) models. Results show that GRF outperforms all models and that the importance of variables highly varies by location. For example, lack of health insurance is the most important factor in one-third (34.86%) of the US counties. Most of these counties are (concentrated mainly in the Midwest region and South region). On the other hand, no leisure-time physical activity is the most important primary factor for 19.86% of the US counties. These counties are found in California, Oregon, Washington, and parts of the South region. Understanding the location-based characteristics and spatial patterns of socioeconomic and health factors linked to COVID-19 deaths is paramount for policy designing and decision making. In this way, interventions can be designed and implemented based on the most important factors locally, avoiding thus general guidelines addressed for the entire nation.
引用
收藏
页数:12
相关论文
共 83 条
[1]   Why inequality could spread COVID-19 [J].
Ahmed, Faheem ;
Ahmed, Na'eem ;
Pissarides, Christopher ;
Stiglitz, Joseph .
LANCET PUBLIC HEALTH, 2020, 5 (05) :E240-E240
[2]  
Ali J., 2012, INT J COMPUT SCI ISS, V9, P272
[3]   POINTS OF SIGNIFICANCE Ensemble methods: bagging and random forests [J].
Altman, Naomi ;
Krzywinski, Martin .
NATURE METHODS, 2017, 14 (10) :933-934
[4]   Analyzing the spatial determinants of local Covid-19 transmission in the United States [J].
Andersen, Lauren M. ;
Harden, Stella R. ;
Sugg, Margaret M. ;
Runkle, Jennifer D. ;
Lundquist, Taylor E. .
SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 754
[5]   Could Historical Mortality Data Predict Mortality Due to Unexpected Events? [J].
Andreopoulos, Panagiotis ;
Kalogeropoulos, Kleomenis ;
Tragaki, Alexandra ;
Stathopoulos, Nikolaos .
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (05)
[6]   Ensemble modelling in descriptive epidemiology: burden of disease estimation [J].
Bannick, Marlena S. ;
McGaughey, Madeline ;
Flaxman, Abraham D. .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2020, 49 (06) :2065-2073
[7]   The Potential Health Care Costs And Resource Use Associated With COVID-19 In The United States [J].
Bartsch, Sarah M. ;
Ferguson, Marie C. ;
McKinnell, James A. ;
O'Shea, Kelly J. ;
Wedlock, Patrick T. ;
Siegmund, Sheryl S. ;
Lee, Bruce Y. .
HEALTH AFFAIRS, 2020, 39 (06) :927-935
[8]   The Coronavirus crisis: What will the post-pandemic city look like? [J].
Batty, Michael .
ENVIRONMENT AND PLANNING B-URBAN ANALYTICS AND CITY SCIENCE, 2020, 47 (04) :547-552
[9]  
Baum C.F., 2020, BOSTON COLL WORKING, P2020
[10]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32