A scoping review on the use of machine learning in research on social determinants of health: Trends and research prospects

被引:47
作者
Kino, Shiho [1 ,2 ]
Hsu, Yu-Tien [1 ]
Shiba, Koichiro [3 ]
Chien, Yung-Shin [1 ]
Mita, Carol [4 ]
Kawachi, Ichiro [1 ]
Daoud, Adel [5 ,6 ,7 ,8 ]
机构
[1] Harvard TH Chan Sch Publ Hlth, Dept Social & Behav Sci, Boston, MA USA
[2] Kyoto Univ, Dept Social Epidemiol, Kyoto, Japan
[3] Harvard TH Chan Sch Publ Hlth, Dept Epidemiol, Boston, MA USA
[4] Harvard Univ, Countway Lib Med, Boston, MA 02115 USA
[5] Harvard Univ, Harvard TH Chan Sch Publ Hlth, Ctr Populat & Dev Studies, Boston, MA 02115 USA
[6] Univ Gothenburg, Dept Sociol & Work Sci, Gothenburg, Sweden
[7] Chalmers Univ Technol, Div Data Sci & Artificial Intelligence, Dept Comp Sci & Engn, Gothenburg, Sweden
[8] Linkoping Univ, Inst Analyt Sociol, Linkoping, Sweden
关键词
Review; Machine learning; Social determinants of health; QUALITY-OF-LIFE; SOCIOECONOMIC-STATUS; DISEASE RISK; CAUSAL INFERENCE; TREE ANALYSIS; PREDICTORS; CLASSIFICATION; INDICATORS; NETWORKS; OPPORTUNITIES;
D O I
10.1016/j.ssmph.2021.100836
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Machine learning (ML) has spread rapidly from computer science to several disciplines. Given the predictive capacity of ML, it offers new opportunities for health, behavioral, and social scientists. However, it remains unclear how and to what extent ML is being used in studies of social determinants of health (SDH). Methods: Using four search engines, we conducted a scoping review of studies that used ML to study SDH (published before May 1, 2020). Two independent reviewers analyzed the relevant studies. For each study, we identified the research questions, Results, data, and algorithms. We synthesized our findings in a narrative report. Results: Of the initial 8097 hits, we identified 82 relevant studies. The number of publications has risen during the past decade. More than half of the studies (n = 46) used US data. About 80% (n = 66) utilized surveys, and 70% (n = 57) employed ML for common prediction tasks. Although the number of studies in ML and SDH is growing rapidly, only a few studies used ML to improve causal inference, curate data, or identify social bias in predictions (i.e., algorithmic fairness). Conclusions: While ML equips researchers with new ways to measure health outcomes and their determinants from non-conventional sources such as text, audio, and image data, most studies still rely on traditional surveys. Although there are no guarantees that ML will lead to better social epidemiological research, the potential for innovation in SDH research is evident as a result of harnessing the predictive power of ML for causality, data curation, or algorithmic fairness.
引用
收藏
页数:20
相关论文
共 128 条
[11]  
Bastaminia A, 2017, JAMBA-J DISASTER RIS, V9
[12]   A Prediction Model for Uncontrolled Type 2 Diabetes Mellitus Incorporating Area-level Social Determinants of Health [J].
Basu, Sanjay ;
Narayanaswamy, Rajiv .
MEDICAL CARE, 2019, 57 (08) :592-600
[13]   Disability and Chronic Disease Among Older Adults in India: Detecting Vulnerable Populations Through the WHO SAGE Study [J].
Basu, Sanjay ;
King, Abby C. .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2013, 178 (11) :1620-1628
[14]  
Bellavia A., 2020, EPIDEMIOLOGIC METHOD, V1
[15]   The impact of social housing on mental health: longitudinal analyses using marginal structural models and machine learning-generated weights [J].
Bentley, Rebecca ;
Baker, Emma ;
Simons, Koen ;
Simpson, Julie A. ;
Blakely, Tony .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2018, 47 (05) :1414-1422
[16]   Association between access to social service resources and cardiometabolic risk factors: a machine learning and multilevel modeling analysis [J].
Berkowitz, Seth A. ;
Basu, Sanjay ;
Venkataramani, Atheendar ;
Reznor, Gally ;
Fleegler, Eric W. ;
Atlas, Steven J. .
BMJ OPEN, 2019, 9 (03)
[17]   Value of Neighborhood Socioeconomic Status in Predicting Risk of Outcomes in Studies That Use Electronic Health Record Data [J].
Bhavsar, Nrupen A. ;
Gao, Aijing ;
Phelan, Matthew ;
Pagidipati, Neha J. ;
Goldstein, Benjamin A. .
JAMA NETWORK OPEN, 2018, 1 (05) :e182716
[18]   What is Machine Learning? A Primer for the Epidemiologist [J].
Bi, Qifang ;
Goodman, Katherine E. ;
Kaminsky, Joshua ;
Lessler, Justin .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2019, 188 (12) :2222-2239
[19]   Machine learning as a strategy to account for dietary synergy: an illustration based on dietary intake and adverse pregnancy outcomes [J].
Bodnar, Lisa M. ;
Cartus, Abigail R. ;
Kirkpatrick, Sharon, I ;
Himes, Katherine P. ;
Kennedy, Edward H. ;
Simhan, Hyagriv N. ;
Grobman, William A. ;
Duffy, Jennifer Y. ;
Silver, Robert M. ;
Parry, Samuel ;
Naimi, Ashley, I .
AMERICAN JOURNAL OF CLINICAL NUTRITION, 2020, 111 (06) :1235-1243
[20]  
Boerstler H, 1991, J Ment Health Adm, V18, P27, DOI 10.1007/BF02521131