Large language models to identify social determinants of health in electronic health records

被引：53

作者：

Guevara, Marco ^{[1
,2
]}

Chen, Shan ^{[1
,2
]}

Thomas, Spencer ^{[1
,2
,3
]}

Chaunzwa, Tafadzwa L. ^{[1
,2
]}

Franco, Idalid ^{[2
]}

Kann, Benjamin H. ^{[1
,2
]}

Moningi, Shalini ^{[2
]}

Qian, Jack M. ^{[1
,2
]}

Goldstein, Madeleine ^{[4
]}

Harper, Susan ^{[4
]}

Aerts, Hugo J. W. L. ^{[1
,2
,5
,6
]}

Catalano, Paul J. ^{[7
,8
]}

Savova, Guergana K. ^{[3
]}

Mak, Raymond H. ^{[1
,2
]}

Bitterman, Danielle S. ^{[1
,2
]}

机构：

[1] Harvard Med Sch, Artificial Intelligence Med AIM Program, Mass Gen Brigham, Boston, MA 02115 USA

[2] Brigham & Womens Hosp, Dana Farber Canc Inst, Dept Radiat Oncol, Boston, MA 02115 USA

[3] Harvard Med Sch, Boston Childrens Hosp, Computat Hlth Informat Program, Boston, MA USA

[4] Dana Farber Canc Inst, Adult Resource Off, Boston, MA USA

[5] Maastricht Univ, Radiol & Nucl Med, GROW, Maastricht, Netherlands

[6] Maastricht Univ, CARIM, Maastricht, Netherlands

[7] Dana Farber Canc Inst, Dept Data Sci, Boston, MA USA

[8] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA USA

来源：

NPJ DIGITAL MEDICINE | 2024年 / 7卷 / 01期

基金：

欧洲研究理事会;

关键词：

ADVERSE CHILDHOOD EXPERIENCES; UNITED-STATES; SUPPORT; MORTALITY; SURVIVAL; WOMEN;

D O I：

10.1038/s41746-023-00970-0

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Social determinants of health (SDoH) play a critical role in patient outcomes, yet their documentation is often missing or incomplete in the structured data of electronic health records (EHRs). Large language models (LLMs) could enable high-throughput extraction of SDoH from the EHR to support research and clinical care. However, class imbalance and data limitations present challenges for this sparsely documented yet critical information. Here, we investigated the optimal methods for using LLMs to extract six SDoH categories from narrative text in the EHR: employment, housing, transportation, parental status, relationship, and social support. The best-performing models were fine-tuned Flan-T5 XL for any SDoH mentions (macro-F1 0.71), and Flan-T5 XXL for adverse SDoH mentions (macro-F1 0.70). Adding LLM-generated synthetic data to training varied across models and architecture, but improved the performance of smaller Flan-T5 models (delta F1 + 0.12 to +0.23). Our best-fine-tuned models outperformed zero- and few-shot performance of ChatGPT-family models in the zero- and few-shot setting, except GPT4 with 10-shot prompting for adverse SDoH. Fine-tuned models were less likely than ChatGPT to change their prediction when race/ethnicity and gender descriptors were added to the text, suggesting less algorithmic bias (p < 0.05). Our models identified 93.8% of patients with adverse SDoH, while ICD-10 codes captured 2.0%. These results demonstrate the potential of LLMs in improving real-world evidence on SDoH and assisting in identifying patients who could benefit from resource support.

引用

页数：14

共 68 条

[1] [Anonymous], Multi-document annotation environment
[2] [Anonymous], SOCIAL DETERMINANTS
[3] [Anonymous], MedspaCy spaCy universe. medspaCy
[4] [Anonymous], OpenAI API
[5] Mining 100 million notes to find homelessness and adverse childhood experiences: 2 case studies of rare and severe social determinants of health in electronic health records
Bejan, Cosmin A.
Angiolillo, John
Conway, Douglas
Nash, Robertson
Shirey-Rice, Jana K.
Lipworth, Loren
Cronin, Robert M.
Pulley, Jill
Kripalani, Sunil
Barkin, Shari
Johnson, Kevin B.
Denny, Joshua C.
[J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2018, 25 (01) : 61 - 71
[6] Social Determinants and Military Veterans' Suicide Ideation and Attempt: a Cross-sectional Analysis of Electronic Health Record Data
Blosnich, John R.
Montgomery, Ann Elizabeth
Dichter, Melissa E.
Gordon, Adam J.
Kavalieratos, Dio
Taylor, Laura
Ketterer, Bryan
Bossarte, Robert M.
[J]. JOURNAL OF GENERAL INTERNAL MEDICINE, 2020, 35 (06) : 1759 - 1767
[7] Semantics derived automatically from language corpora contain human-like biases
Caliskan, Aylin
Bryson, Joanna J.
Narayanan, Arvind
[J]. SCIENCE, 2017, 356 (6334) : 183 - 186
[8] Excess Mortality and Years of Potential Life Lost Among the Black Population in the US, 1999-2020
Caraballo, Cesar
Massey, Daisy S.
Ndumele, Chima D.
Haywood, Trent
Kaleem, Shayaan
King, Terris
Liu, Yuntian
Lu, Yuan
Nunez-Smith, Marcella
Taylor, Herman A.
Watson, Karol E.
Herrin, Jeph
Yancy, Clyde W.
Faust, Jeremy Samuel
Krumholz, Harlan M.
[J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2023, 329 (19): : 1662 - 1670
[9] Synthetic data in machine learning for medicine and healthcare
Chen, Richard J.
Lu, Ming Y.
Chen, Tiffany Y.
Williamson, Drew F. K.
Mahmood, Faisal
[J]. NATURE BIOMEDICAL ENGINEERING, 2021, 5 (06) : 493 - 497
[10] Chen S., 2023, JCO Clin. Cancer Inf, V7, pe2300048

← 1 2 3 4 5 6 7 →