AD-BERT: Using pre-trained language model to predict the progression from mild cognitive impairment to Alzheimer's disease

被引:22
作者
Mao, Chengsheng [1 ]
Xu, Jie [2 ,3 ]
Rasmussen, Luke [1 ]
Li, Yikuan [1 ]
Adekkanattu, Prakash [3 ]
Pacheco, Jennifer [1 ]
Bonakdarpour, Borna [4 ]
Vassar, Robert [4 ]
Shen, Li [5 ]
Jiang, Guoqian [6 ]
Wang, Fei [3 ]
Pathak, Jyotishman [3 ]
Luo, Yuan [1 ,7 ]
机构
[1] Northwestern Univ, Feinberg Sch Med, Dept Prevent Med, Chicago, IL USA
[2] Univ Florida, Dept Hlth Outcomes & Biomed Informat, Gainesville, FL USA
[3] Weill Cornell Med, New York, NY USA
[4] Northwestern Univ, Feinberg Sch Med, Dept Neurol, Chicago, IL USA
[5] Univ Penn, Dept Biostat Epidemiol & Informat, Philadelphia, PA USA
[6] Mayo Clin, Rochester, MN USA
[7] 750 N Lake Shore Dr,11th Floor, Chicago, IL 60611 USA
关键词
Alzheimer 's disease; Mild cognitive impairment; Pre-trained language model; Electronic health records; ASSOCIATION WORKGROUPS; DIAGNOSTIC GUIDELINES; NATIONAL INSTITUTE; RECOMMENDATIONS; DEMENTIA; RISK;
D O I
10.1016/j.jbi.2023.104442
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: We develop a deep learning framework based on the pre-trained Bidirectional Encoder Representations from Transformers (BERT) model using unstructured clinical notes from electronic health records (EHRs) to predict the risk of disease progression from Mild Cognitive Impairment (MCI) to Alzheimer's Disease (AD). Methods: We identified 3657 patients diagnosed with MCI together with their progress notes from Northwestern Medicine Enterprise Data Warehouse (NMEDW) between 2000 and 2020. The progress notes no later than the first MCI diagnosis were used for the prediction. We first preprocessed the notes by deidentification, cleaning and splitting into sections, and then pre-trained a BERT model for AD (named AD-BERT) based on the publicly available Bio+Clinical BERT on the preprocessed notes. All sections of a patient were embedded into a vector representation by AD-BERT and then combined by global MaxPooling and a fully connected network to compute the probability of MCI-to-AD progression. For validation, we conducted a similar set of experiments on 2563 MCI patients identified at Weill Cornell Medicine (WCM) during the same timeframe.Results: Compared with the 7 baseline models, the AD-BERT model achieved the best performance on both datasets, with Area Under receiver operating characteristic Curve (AUC) of 0.849 and F1 score of 0.440 on NMEDW dataset, and AUC of 0.883 and F1 score of 0.680 on WCM dataset.Conclusion: The use of EHRs for AD-related research is promising, and AD-BERT shows superior predictive performance in modeling MCI-to-AD progression prediction. Our study demonstrates the utility of pre-trained language models and clinical notes in predicting MCI-to-AD progression, which could have important implica-tions for improving early detection and intervention for AD.
引用
收藏
页数:8
相关论文
共 70 条
[1]   Predicting dementia from spontaneous speech using large language models [J].
Agbavor, Felix ;
Liang, Hualou .
PLOS DIGITAL HEALTH, 2022, 1 (12)
[2]   The diagnosis of mild cognitive impairment due to Alzheimer's disease: Recommendations from the National Institute on Aging-Alzheimer's Association workgroups on diagnostic guidelines for Alzheimer's disease [J].
Albert, Marilyn S. ;
DeKosky, Steven T. ;
Dickson, Dennis ;
Dubois, Bruno ;
Feldman, Howard H. ;
Fox, Nick C. ;
Gamst, Anthony ;
Holtzman, David M. ;
Jagust, William J. ;
Petersen, Ronald C. ;
Snyder, Peter J. ;
Carrillo, Maria C. ;
Thies, Bill ;
Phelps, Creighton H. .
ALZHEIMERS & DEMENTIA, 2011, 7 (03) :270-279
[3]  
Alsentzer E., 2019, arXiv
[4]   2021 Alzheimer's disease facts and figures [J].
不详 .
ALZHEIMERS & DEMENTIA, 2021, 17 (03) :327-406
[5]   Deep Learning to Predict Hospitalization at Triage: Integration of Structured Data and Unstructured Text [J].
Arnaud, Emilien ;
Elbattah, Mahmoud ;
Gignon, Maxime ;
Dequen, Gilles .
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, :4836-4841
[6]  
Bottou Leon, 2012, Neural Networks: Tricks of the Trade. Second Edition: LNCS 7700, P421, DOI 10.1007/978-3-642-35289-8_25
[7]   Information Extraction From Electronic Health Records to Predict Readmission Following Acute Myocardial Infarction: Does Natural Language Processing Using Clinical Notes Improve Prediction of Readmission? [J].
Brown, Jeremiah R. ;
Ricket, Iben M. ;
Reeves, Ruth M. ;
Shah, Rashmee U. ;
Goodrich, Christine A. ;
Gobbel, Glen ;
Stabler, Meagan E. ;
Perkins, Amy M. ;
Minter, Freneka ;
Cox, Kevin C. ;
Dorn, Chad ;
Denton, Jason ;
Bray, Bruce E. ;
Gouripeddi, Ramkiran ;
Higgins, John ;
Chapman, Wendy W. ;
MacKenzie, Todd ;
Matheny, Michael E. .
JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2022, 11 (07)
[8]  
Brown TB, 2020, ADV NEUR IN, V33
[9]   Challenges in adapting existing clinical natural language processing systems to multiple, diverse health care settings [J].
Carrell, David S. ;
Schoen, Robert E. ;
Leffler, Daniel A. ;
Morris, Michele ;
Rose, Sherri ;
Baer, Andrew ;
Crockett, Seth D. ;
Gourevitch, Rebecca A. ;
Dean, Katie M. ;
Mehrotra, Ateev .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (05) :986-991
[10]   Emergency department disposition prediction using a deep neural network with integrated clinical narratives and structured data [J].
Chen, Chien-Hua ;
Hsieh, Jer-Guang ;
Cheng, Shu-Ling ;
Lin, Yih-Lon ;
Lin, Po-Hsiang ;
Jeng, Jyh-Horng .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2020, 139