Utilizing big data without domain knowledge impacts public health decision-making

被引:0
作者
Zhang, Miao [1 ]
Rahman, Salman [1 ]
Mhasawade, Vishwali [1 ]
Chunara, Rumi [1 ]
机构
[1] Tandon Sch Engn, Dept Comp Sci & Engn, Brooklyn, NY 11201 USA
关键词
obesity; diabetes; data; Google Street View;
D O I
10.1073/pnas.2402387121
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
New data sources and AI methods for extracting information are increasingly abundant and relevant to decision-making across societal applications. A notable example is street view imagery, available in over 100 countries, and purported to inform built environment interventions (e.g., adding sidewalks) for community health outcomes. However, biases can arise when decision-making does not account for data robustness or relies on spurious correlations. To investigate this risk, we analyzed 2.02 million Google Street View (GSV) images alongside health, demographic, and socioeconomic data from New York City. Findings demonstrate robustness challenges; built environment characteristics inferred from GSV labels at the intracity level often do not align with ground truth. Moreover, as average individual-level behavior of physical inactivity significantly mediates the impact of built environment features by census tract, intervention on features measured by GSV would be misestimated without proper model specification and consideration of this mediation mechanism. Using a causal framework accounting for these mediators, we determined that intervening by improving 10% of samples in the two lowest tertiles of physical inactivity would lead to a 4.17 (95% CI 3.84-4.55) or 17.2 (95% CI 14.4-21.3) times greater decrease in the prevalence of obesity or diabetes, respectively, compared to the same proportional intervention on the number of crosswalks by census tract. This study highlights critical issues of robustness and model specification in using emergent data sources, showing the data may not measure what is intended, and ignoring mediators can result in biased intervention effect estimates.
引用
收藏
页数:3
相关论文
共 15 条
[1]  
Buolamwini Joy, 2018, C FAIRN ACC TRANSP, V81, P1, DOI DOI 10.2147/OTT.S126905
[2]   AI for radiographic COVID-19 detection selects shortcuts over signal [J].
DeGrave, Alex J. ;
Janizek, Joseph D. ;
Lee, Su-In .
NATURE MACHINE INTELLIGENCE, 2021, 3 (07) :610-619
[3]   PLACES: Local Data for Better Health [J].
Greenlund, Kurt J. ;
Lu, Hua ;
Wang, Yan ;
Matthews, Kevin A. ;
LeClercq, Jennifer M. ;
Lee, Benjamin ;
Carlson, Susan A. ;
Jm, LeClercq .
PREVENTING CHRONIC DISEASE, 2022, 19
[4]   Obesity and Physical Activity [J].
Jakicic, John M. ;
Davis, Kelliann K. .
PSYCHIATRIC CLINICS OF NORTH AMERICA, 2011, 34 (04) :829-+
[5]   Health and the built environment in United States cities: measuring associations using Google Street View-derived indicators of the built environment [J].
Keralis, Jessica M. ;
Javanmardi, Mehran ;
Khanna, Sahil ;
Dwivedi, Pallavi ;
Huang, Dina ;
Tasdizen, Tolga ;
Nguyen, Quynh C. .
BMC PUBLIC HEALTH, 2020, 20 (01)
[6]   Decoding urban landscapes: Google street view and measurement sensitivity [J].
Kim, Jae Hong ;
Lee, Sugie ;
Hipp, John R. ;
Ki, Donghwan .
COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2021, 88
[7]   Leisure-Time Running Reduces All-Cause and Cardiovascular Mortality Risk [J].
Lee, Duck-chul ;
Pate, Russell R. ;
Lavie, Carl J. ;
Sui, Xuemei ;
Church, Timothy S. ;
Blair, Steven N. .
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2014, 64 (05) :472-481
[8]   Google Street View Images as Predictors of Patient Health Outcomes, 2017-2019 [J].
Nguyen, Quynh C. ;
Belnap, Tom ;
Dwivedi, Pallavi ;
Deligani, Amir Hossein Nazem ;
Kumar, Abhinav ;
Li, Dapeng ;
Whitaker, Ross ;
Keralis, Jessica ;
Mane, Heran ;
Yue, Xiaohe ;
Nguyen, Thu T. ;
Tasdizen, Tolga ;
Brunisholz, Kim D. .
BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (01)
[9]   Leveraging 31 Million Google Street View Images to Characterize Built Environments and Examine County Health Outcomes [J].
Nguyen, Quynh C. ;
Keralis, Jessica M. ;
Dwivedi, Pallavi ;
Ng, Amanda E. ;
Javanmardi, Mehran ;
Khanna, Sahil ;
Huang, Yuru ;
Brunisholz, Kimberly D. ;
Kumar, Abhinav ;
Tasdizen, Tolga .
PUBLIC HEALTH REPORTS, 2021, 136 (02) :201-211
[10]   Neighbourhood blue space, health and wellbeing: The mediating role of different types of physical activity [J].
Pasanen, Tytti P. ;
White, Mathew P. ;
Wheeler, Benedict W. ;
Garrett, Joanne K. ;
Elliott, Lewis R. .
ENVIRONMENT INTERNATIONAL, 2019, 131