Automatic extraction and assessment of lifestyle exposures for Alzheimer's disease using natural language processing

被引:15
作者
Zhou, Xin [1 ]
Wang, Yanshan [1 ]
Sohn, Sunghwan [1 ]
Therneau, Terry M. [1 ]
Liu, Hongfang [1 ]
Knopman, David S. [2 ]
机构
[1] Mayo Clin, Dept Hlth Sci Res, Rochester, MN 55902 USA
[2] Mayo Clin, Dept Neurol, Rochester, MN USA
关键词
Alzheimer's disease; Electronic health records; Natural language processing; Lifestyle exposure; CLINICAL INFORMATION; RISK-FACTORS; VITAMIN-D; DEMENTIA; DEFICIENCY; ADHERENCE; SYSTEM; MASS;
D O I
10.1016/j.ijmedinf.2019.08.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Introduction: Previous biomedical studies identified many lifestyle exposures that could possibly represent risk factors for dementia in general or dementia due to Alzheimer's disease (AD). These lifestyle exposures are mainly mentioned in free-text electronic health records (EHRs). However, automatic extraction and assessment of these exposures using EHRs remains understudied. Methods: A natural language processing (NLP) approach was adopted to extract lifestyle exposures and intervention strategies from the clinical notes of 260 patients with clinical diagnoses of AD dementia and 260 age-matched cognitively unimpaired persons. Statistics of lifestyle exposures were compared between these two groups. The mapping results of the NLP extraction were evaluated by comparing the results with data captured independently by clinicians. Results: Thirty out of fifty-five potentially relevant lifestyle exposures were mentioned in our clinical note dataset. Twenty-two dietary factors and three substance abuses that were potentially relevant were not found in clinical notes. Patients with AD dementia were significantly exposed to more of the potential risk factors compared to the cognitively unimpaired subjects (chi 2 = 120.31, p-value < 0.001). The average accuracy of the automated extraction was 74.0% in comparison with the manual review of randomly selected 50 sample documents. Discussion and conclusion: We illustrated the feasibility of NLP techniques for the automated evaluation of a large number lifestyle habits using free-text EHR data. We found that AD dementia patients were exposed to more of the potential risk factors than the comparison group. Our results also demonstrated the feasibility and accuracy of investigating putative risk factors using NLP techniques.
引用
收藏
页数:9
相关论文
共 46 条
[1]  
[Anonymous], 2018, PREVENTION REVERSAL
[2]   Are Certain Lifestyle Habits Associated with Lower Alzheimer's Disease Risk? [J].
Arab, Lana ;
Sabbagh, Marwan N. .
JOURNAL OF ALZHEIMERS DISEASE, 2010, 20 (03) :785-794
[3]   An overview of MetaMap: historical perspective and recent advances [J].
Aronson, Alan R. ;
Lang, Francois-Michel .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (03) :229-236
[4]  
Association A.s, 2018, ALZH DIS FACTS FIG
[5]   Nutrient patterns and brain biomarkers of Alzheimer's disease in cognitively normal individuals [J].
Berti, V. ;
Murray, J. ;
Davies, M. ;
Spector, N. ;
Tsui, W. H. ;
Li, Y. ;
Williams, S. ;
Pirraglia, E. ;
Vallabhajosula, S. ;
McHugh, P. ;
Pupi, A. ;
de Leon, M. J. ;
Mosconi, L. .
JOURNAL OF NUTRITION HEALTH & AGING, 2015, 19 (04) :413-423
[6]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[7]   Subclinical Zinc Deficiency in Alzheimer's Disease and Parkinson's Disease [J].
Brewer, George J. ;
Kanzer, Steve H. ;
Zimmerman, Earl A. ;
Molho, Eric S. ;
Celmins, Dzintra F. ;
Heckman, Susan M. ;
Dick, Robert .
AMERICAN JOURNAL OF ALZHEIMERS DISEASE AND OTHER DEMENTIAS, 2010, 25 (07) :572-575
[8]   Jumping NLP Curves: A Review of Natural Language Processing Research [J].
Cambria, Erik ;
White, Bebo .
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2014, 9 (02) :48-57
[9]   Assessing explicit error reporting in the narrative electronic medical record using keyword searching [J].
Cao, H ;
Stetson, P ;
Hripcsak, G .
JOURNAL OF BIOMEDICAL INFORMATICS, 2003, 36 (1-2) :99-105
[10]   Folate, vitamin B12, and serum total homocysteine levels in confirmed Alzheimer disease [J].
Clarke, R ;
Smith, AD ;
Jobst, KA ;
Refsum, H ;
Sutton, L ;
Ueland, PM .
ARCHIVES OF NEUROLOGY, 1998, 55 (11) :1449-1455