Ranking sociodemographic, health behavior, prevention, and environmental factors in predicting neighborhood cardiovascular health: A Bayesian machine learning approach

被引:19
作者
Hu, Liangyuan [1 ]
Liu, Bian [1 ]
Li, Yan [1 ,2 ]
机构
[1] Icahn Sch Med Mt Sinai, Dept Populat Hlth Sci & Policy, New York, NY 10029 USA
[2] Icahn Sch Med Mt Sinai, Dept Obstet Gynecol & Reprod Sci, New York, NY 10029 USA
关键词
Health behaviors; Prevention; Cardiovascular health; Neighborhood; Machine learning; HEART; REGRESSION;
D O I
10.1016/j.ypmed.2020.106240
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Cardiovascular disease is the leading cause of death in the United States. While abundant research has been conducted to identify risk factors for cardiovascular disease at the individual level, less is known about factors that may influence population cardiovascular health outcomes at the neighborhood level. The purpose of this study is to use Bayesian Additive Regression Trees, a state-of-the-art machine learning approach, to rank sociodemographic, health behavior, prevention, and environmental factors in predicting neighborhood cardiovascular health. We created a new neighborhood health dataset by combining three datasets at the census tract level, including the 500 Cities Data from the Centers for Disease Control and Prevention, the 2011-2015 American Community Survey 5-Year Estimates from the Census Bureau, and the 2015-2016 Environmental Justice Screening database from the Environmental Protection Agency in the United States. Results showed that neighborhood behavioral factors such as the proportions of people who are obese, do not have leisure-time physical activity, and have binge drinking emerged as top five predictors for most of the neighborhood cardiovascular health outcomes. Findings from this study would allow public health researchers and policymakers to prioritize community-based interventions and efficiently use limited resources to improve neighborhood cardiovascular health.
引用
收藏
页数:4
相关论文
共 17 条
[1]   TyG-er: An ensemble Regression Forest approach for identification of clinical factors related to insulin resistance condition using Electronic Health Records [J].
Bernardini, Michele ;
Morettini, Micaela ;
Romeo, Luca ;
Frontoni, Emanuele ;
Burattini, Laura .
COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 112
[2]   VARIABLE SELECTION FOR BART: AN APPLICATION TO GENE REGULATION [J].
Bleich, Justin ;
Kapelner, Adam ;
George, Edward I. ;
Jensen, Shane T. .
ANNALS OF APPLIED STATISTICS, 2014, 8 (03) :1750-1781
[3]  
Centers for Disease Control and Prevention (CDC), 2017, National Vital Statistics Reports: Deaths: Final Data for 2017
[4]   BART: BAYESIAN ADDITIVE REGRESSION TREES [J].
Chipman, Hugh A. ;
George, Edward I. ;
McCulloch, Robert E. .
ANNALS OF APPLIED STATISTICS, 2010, 4 (01) :266-298
[5]  
EPA, 2016, ENV JUST MAPP SCREEN
[6]   Bayesian Nonparametric Modeling for Causal Inference [J].
Hill, Jennifer L. .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2011, 20 (01) :217-240
[7]   Estimation of causal effects of multiple treatments in observational studies with a binary outcome [J].
Hu, Liangyuan ;
Gu, Chenyang ;
Lopez, Michael ;
Ji, Jiayi ;
Wisnivesky, Juan .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (11) :3218-3234
[8]  
Kapelner A, 2016, J STAT SOFTW, V70, P1
[9]  
Kumar NK, 2020, INT CONF ADVAN COMPU, P15, DOI [10.1109/ICACCS48705.2020.9074183, 10.1109/icaccs48705.2020.9074183]
[10]   Unhealthy Behaviors, Prevention Measures, and Neighborhood Cardiovascular Health: A Machine Learning Approach [J].
Li, Yan ;
Liu, Shelley H. ;
Niu, Li ;
Liu, Bian .
JOURNAL OF PUBLIC HEALTH MANAGEMENT AND PRACTICE, 2019, 25 (01) :E25-E28